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Porcine Retrovirus 
The present invention relates inter alia to porcine 
retrovirus (PoEV) fragments, in particular polynucleotide 
fragments encoding at least one porcine retrovirus expression 
product, a recombinant vector comprising at least one 
polynucleotide fragment, use of PoEV polynucleotide fragments in 
the detection of native porcine retrovirus, a host cell 
containing at least one PoEV polynucleotide fragment or a 
recombinant vector comprising at least one PoEV polynucleotide 
fragment, PoEV polypeptides, antibodies immuno-react ive with PoEV 
polypeptides, pharmaceutical compositions comprising recombinant 
PoEV polypeptides for use as prophylactic and/or therapeutic 
agents and uses of PoEV polynucleotide fragments and/ or 
polypeptides in. medicine, including veterinary medicine and in 
the preparation of medicaments for use in medicine, including 
veterinary medicine. 

Porcine retrovirus (PoEV) is an endogenous (genetically 
acquired) retrovirus isolated from pigs and expressed in cell 
lines derived from porcine material. There are no known 
pathogenic effects associated with the virus per se in its 
natural host although the virus appears to be associated with 
lymphomas in pigs and related viruses are associated with 
leukaemias and lymphomas in other species. The virus has been 
reported to infect cells from a variety of non-porcine origins 
and is, therefore, designated as a xenotropic, amphotropic or 
polytrophic virus (Lieber MM, Sherr C J . Benveniste RE and Todaro 
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GJ. 1975; Strandstrom H, Verjalainen P, Moening V, Hunsmann G, 
Schwarz H, and Schafer W. 1974; Todaro GJ, Benveniste RE, Lieber 
MM and Sherr CJ. 1974). The observation that the above viruses 
may have the potential to infect humans and have a pathogenic 
effect suggests that the issue of porcine retroviruses must be 
addressed in the context of xenotransplanting pig tissues or 
cells. Therefore, information on the properties of PoEV and the 
development of diagnostic reagents, molecular engineering tools 
and potential vaccine materials is of paramount importance for 
example in xenotransplantation technology and the like. 

It is an object of the present invention to obviate and/or 
mitigate against at least some of the above disadvantages. 

In one aspect the present invention provides an isolated 
PoEV polynucleotide fragment: 

(a) encoding at least one porcine retrovirus (PoEV) 
expression product; 

(b) encoding a physiologically active and/or immunogenic 
derivative of said expression product; or 

(c) which is complementary to a polynucleotide sequence as 
defined in (a) or (b) . 

Preferably, the polynucleotide fragment encodes the gag gene 
(gag), polymerase gene (pol) and/or envelope (env) gene of PoEV. 
Thus, said expression product can be the virion core polypeptides 
(GAG) and polymerase (POL) and/or envelope (ENV) polypeptides of 
PoEV. Thus, the invention further provides a recombinant PoEV 
virion core, polymerase and/or envelope polypeptide. 

"Polynucleotide fragment" as used herein refers to a chain 
of nucleotides such as deoxyribose nucleic acid ( DNA ) and 
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transcription products thereof, such as RNA. Naturally, the 
skilled addressee will appreciate the whole naturally occurring 
PoEV genome is not included in the definition of polynucleotide 
fragment. 

The polynucleotide fragment can be isolated in the sense 
that it is substantially free of biological material with which 
the whole genome is normally associated in vivo. The isolated 
polynucleotide fragment may be cloned to provide a recombinant 
molecule comprising the polynucleotide fragment. Thus, 
-polynucleotide fragment" includes double and single stranded 
DNA, RNA and polynucleotide sequences derived therefrom, for 
example, subsequences of said fragment and which are of any 
desirable length. Where a nucleic acid is single stranded then 
both a given strand and a sequence complementary thereto is 
within the scope of the present invention. 

in general, the term "expression product" refers to both 
transcription and translation products of said polynucleotide 
fragments. When the expression product is a "polypeptide" (i.e. 
a chain or sequence of amino acids displaying a biological and/or 
immunological activity substantially similar to the biological 
and/or immunological activity of PoEV virion core, polymerase 
and/or envelope protein) , it does not refer to a specific length 
of the product as such. Thus, the skilled addressee will 
appreciate that "polypeptide" encompasses inter alia peptides, 
polypeptides and proteins of PoEV. The polypeptide if required, 
can be modified in vivo and in vitro, for example by 
glycosylation, amidation, carboxy lat ion , phosphorylation and/or 
post-translational cleavage. 
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Polynucleotide fragments comprising portions encompassing 
the PoEV genome, and derived from retrovirus particles released 
from a reverse transcriptase-positive porcine kidney cell line 
PK-15, have been molecular ly cloned into a plasmid vector. This 
was achieved by synthesising cDNAs of PoEV RNA genomes which were 
recovered from porcine kidney cells expressing the endogenous 
virus. The cDNA was cloned into a plasmid vector and the 
isolated PoEV DNA fragment determined (see Figures 1,2 and 3). 
The sequence of the sequence identified in Figure 1 was the 
earliest determined sequence, followed by the sequence in Figure 
2 and lastly by the most recently revised sequence shown in 
Figure 3. An additional study has been carried out to determine 
whether or not the human cell line "Raji" was susceptible to 
infection with the PoEV present in porcine kidney cells (PK15) . 
A raji clone has now been obtained and the DNA sequence of its 
env gene region has been determined (see Figure 4). 

The DNA fragment of Figure 3 was shown to encode three open 
reading frames (ORFs) of 524, 1194 and 656 amino acids 
respectively. 

A comparison of the amino acid sequence against previously 
sequenced retroviruses from other species indicated that novel 
retrovirus cDNA had been cloned. Sequence analysis using the 
Lasergene software from DNASTAR Inc. showed that homologies were 
observed between the cloned PoEV DNA and the majority of 
retroviruses and that the closest homologies were to gibbon 
leukaemia virus (GaLV) in the polymerase (pal) and (env) regions 
of the pro-virus. 

The first open reading frame ORF of Figure 3 (nucleotides 588- 
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2162) is predicted to encode the PoEV virion core polypeptide 
(gag gene) . The second ORF (nucleotides 2163-5747) is predicted 
to encode the PoEV polymerase polypeptide (pol gene) . The third 
ORF (nucleotides 5620-7590) is predicted to encode the PoEV 
envelope polypeptide (env gene) . The skilled addressee will 
appreciate that it is possible to genetically manipulate the 
polynucleotide fragment or derivatives thereof, for example to 
clone the gene by recombinant DNA techniques generally known in 
the art and to express the polypeptides encoded thereby in vitro 
and/or in vivo . DNA fragments having the polynucleotide sequence 
depicted in Figures 1,2,3 and/ or 4 or DNA/RNA derivatives 
thereof, may be used as a diagnostic tool or as a reagent for 
detecting PoEV nucleic acid in nucleic acids from donor animals 

or as a vaccine. 

Preferred fragments of this aspect of the invention are 
polynucleotide fragments encoding: (a) at least one of the one 
to three polypeptides having an amino acid sequence which is 
shown in Figures 1,2,3 and/or 4 (b) encoding a polypeptide which 
is a physiologically active and/or immunogenic derivative of at 
least one of the polypeptides defined in (a); or (c) which is 
complementary to a polynucleotide sequence as defined above; or 
polynucleotide fragments: (a) comprising at least one of the ORFs 
shown in Figures 1,2,3 and/or 4 or comprising a corresponding RNA 
sequence; (b) comprising a sequence having substantial nucleotide 
sequence identity with a sequence as described in (a) above; or 
( c) comprising a sequence which is complementary to a sequence 
as described in (a) or (b) above. It is to be understood that 
the term "substantial sequence identity" is taken to mean at 
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least 50% (preferably at least 75% , at least 90%, or at least 
95%) sequence identity. 

The polynucleotide fragment of the present invention may be 
used to examine the expression and/or presence of the PoEV virus 
in donor animals and cells, tissues or organs derived from the 
donor animals to see if they are suitable for xenotransplantation 
(i.e. PoEV free). In addition, the recipients of pig cells, 
tissues or organs can be examined for the presence and/or 
expression of PoEV virus directly or by co-culture or infection 
of susceptible detector cells. 

A polynucleotide fragment of the present invention may be 
used to identify polynucleotide sequences within the PoEV genome 
which are PoEV specific (i.e. it is not necessary for the 
complete PoEV genome to be identified) . Such PoEV specific 
polynucleotide sequences may be used to identify PoEV nucleic 
acid in samples, such as transplanted cells, tissues or organs 
and may be included in a definitive test for PoEV. 

Thus, the present invention further provides an isolated 
PoEV polynucleotide fragment capable of specifically hybridising 
to a PoEV polynucleotide sequence. In this manner, the present 
invention provides probes and/or primers for use in ex vivo 
and/or in situ PoEV virus detection and expression studies. 
Typical detection studies include polymerase chain reaction (PCR) 
studies, hybridisation studies, or sequencing studies. In 
principle any PoEV specific polynucleotide sequence from the 
above identified PoEV sequence may be used in detection and/or 
expression studies . 

"Capable of specifically hybridising" is taken to mean that 
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said polynucleotide fragment preferably hybridises to a PoEV 
polynucleotide sequence in preference to polynucleotide sequences 
of other virus, animal (especially porcine or human sequences) 
and/or other species. In a preferment the PoEV fragment 
specifically binds to a native PoEV polynucleotide sequence or 

a part thereof. 

The invention includes polynucleotide sequence(s) which are 
capable of specifically hybridising to a PoEV polynucleotide 
sequence or to a part thereof without necessarily being 
completely complementary to said PoEV polynucleotide sequence or 
fragment thereof. For example, there may be at least 50% 
preferably at least 75%. most preferably at least 30% or at least 
9 S% complementarity. Of course, in some cases the sequences may 
be exactly complementary (100% complementary, or nearly so (e.g. 
there may be less than 10, preferably less than 5 mismatches). 
Thus, the present invention also provides anti-sense or 
complementary nucleotide sequence(s) which is/are capable of 
specifically hybridising to the disclosed DNA sequence. If a 
POEV specific polynucleotide is to be used as a primer in PC* 
and/or sequencing studies, the polynucleotide must be capable of 
hybridising to PoEV nucleic acid and capable of initiating chain 
extension from 3' end of the polynucleotide, but not able to 
correctly initiate chain extension from non PoEV sequences 
(especially from human, or non-PoEV porcine sequences). 

If a POEV specific test polynucleotide sequence is to be 
used in hybridisation studies, to test for the presence of PoEV 
nucleic acid in a sample, the test polynucleotide should 
preferably remain hybridised to a sample polynucleotide under 
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stringent conditions. If desired, either the test or sample 
polynucleotide may be immobilised. Generally the test: 

polynucleotide sequence is at least 10 or at least 50 bases in 
length. It may be labelled by suitable techniques known in the 
art. Preferably the test polynucleotide sequence is at least 200 
bases in length and may even be several kilobases in length. 
Thus, either a denatured sample or test sequence can be first 
bound to a support . Hybridization can be effected at a 
temperature of between 50 and 70°C in double strength SSC (2xNaCl 
17.5g/l and sodium citrate (SC) at 8.8g/l) buffered saline 
containing 0.1% sodium dodecyl sulphate (SDS). This can be 
followed by rinsing of the support at the same temperature but 
with a buffer having a reduced SSC concentration. Depending upon 
the degree of stringency required, and thus the degree of 
similarity of the sequences, such reduced concentration buffers 
are typically single strength SSC containing 0.1%SDS, half 
strength SSC containing 0.1%SDS and one tenth strength SSC 
containing 0.1%SDS. Sequences having the greatest degree of 
similarity are those the hybridisation of which is least affected 
by washing in buffers of reduced concentration. It is most 
preferred that the sample and inventive sequences are so similar 
that the hybridisation between them is substantially unaffected 
by washing or incubation in one tenth strength sodium citrate 
buffer containing 0.1%SDS. 

PoEV specific oligonucleotides may be designed to 
specifically hybridise to PoEV nucleic acid. They may be 
synthesised, by known techniques and used as primers in PCR or 
sequencing reactions or as probes in hybridisations designed to 
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detect the presence of PoEV material in a sample. The 
oligonucleotides may be labelled by suitable labels Known in the 
art, such as, radioactive labels, chemiluminescent labels or 
flU orescent labels and the like. Thus, the present invention 
also provides PoEV specific oligonucleotide probes and primers. 

The term "oligonucleotide" is not meant to indicate any 
particular length of sequence and encompasses nucleotides of 
preferably at least 10b (e.g. 10b to 1Kb) in length, more 
preferably 12b-500b in length and most preferably 15b to 100b. 

The POEV specific oligonucleotides may be determined from 

^k«,,t, in Fiaure 1 and may be manufactured 
the PoEV sequences shown in figure ± an j 

according to known techniques. They may have substantial 
sequence identity (e.g. at least 50%, at least 75%, at least 90% 
or at least 95% sequence identity) with one of the strands shown 
therein or an RNA equivalent, or with a part of such a strand. 
Preferably such a part is at least 10, at least 30, at least 50 
or at least 200 bases long. It may be an ORE or a part thereof. 

Oligonucleotides which are generally greater than 30 bases 
in length should preferably remain hybridised to a sample 
polynucleotide under one or more of the stringent conditions 
mentioned above. Oligonucleotides which are generally less than 
30 bases in length should also preferably remain hybridised to 
a sample polynucleotide but under different conditions of high 
stringency. Typically the melting temperature of an 

oligonucleotide less than 30 bases may be calculated according 
to the formula of; 2°C for every A or T, plus 4°C for every G or 
C, minus 5-C. Hybridisation may take place at or around the 
calculated melting temperature for any particular 
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oligonucleotide, in 6 x SSC and 1% SDS. Non specifically 
hybridised oligonucleotides may then be removed by stringent 
washing, for example in 3 x SSC and 0.1% SDS at the same 
temperature. Only substantially similar matched sequences remain 
hybridised i.e. said oligonucleotide and corresponding PoEV 
nucleic acid. 

When oligonucleotides of generally less than 30 bases in 
length are used in sequencing and/ or PCR studies, the melting 
temperature may be calculated in the same manner as described 
above. The oligonucleotide may then be allowed to anneal or 
hybridise at a temperature around the oligonucleotides calculated 
melting temperature. In the case of PCR studies the annealing 
temperature should be around the lower of the calculated melting 
temperatures for the two priming oligonucleotides. It is to be 
appreciated that the conditions and melting temperature 
calculations are provided by way of example only and are not 
intended to be limiting. It is possible through the experience 
of the experimenter to vary the conditions of hybridisation and 
thus anneal/hybridise oligonucleotides at temperatures above 
their calculated melting temperature. Indeed this can be 
desirable in preventing so-called non-specific hybridisation from 
occurring . 

It is possible when conducting PCR studies to predict an 
expected size or sizes of PCR product(s) obtainable using an 
appropriate combination of two or more PoEV oligonucleotides, 
based on where they would hybridise to the sequence in Figure 1. 
If, on conducting such a PCR on a sample of PoEV DNA, a fragment 
of the predicted size is obtained, then this is predictive that 
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the DNA is PoEV. 

The present Invention also encompasses PoEV detection kits 
including at least one oligonucleotide which is PoEV specific. 
a s well as any necessary reaction reagents, washing reagents, 
detection reagents, signal producing agents and the like for use 
in the test formats outlined above. 

In a further aspect there is also provided use of a PoEV 
specific polynucleotide in the detection of PoEV in a sample. 

in a yet further aspect there is provided use of a PoEV 

-, 4-^~ in =, pcr for the detection of PoEV in a 
specific polynucleotide in a PCR ror w 

sample. 

The skilled addressee will appreciate how polynucleotide 
fragments may be designed and used as primers/probes in 
polymerase chain reaction (PCR) experiments or southern analysis 
(i e . hybridisation studies) for detecting the presence or 
otherwise of PoEV polynucleotide in the nucleic acid of pigs or 
in cell, tissue or organ samples taken from pigs (e.g. from 
potential transplant organs such as liver, kidney and heart,, 
such cells, tissues or organs can be derived from transgenic 
animals produced as described in EP-A-0493852 , or by other means 
known in the art. Thus the cells, tissues or organs of 
transgenic pigs can be associated with one or more homologous 
complement restriction factors active in humans to prevent /reduce 

activation of complement. 

Furthermore the polynucleotide fragments of the present 
invention can be used to analyze the genetic organisation of 
endogenous PoEV located in the animal cell genome in pigs thus 
permitting the ser.en.ng of herds of Pl gs for altered proves 
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and genomic loci (e.g. non-expressed provirus loci). Such a 
screening method would facilitate, for example, screening in a 
population of animals which are bred to lack expressed provirus 
and genomic loci and/or loci that do not encode infectious virus 
particles . 

Reagents may also be developed from said polynucleotide 
fragments as aids to develop pigs that do not express an 
infectious, PoEV capable of infecting humans. Such pigs could 
still contain partial defective genomes that could result in the 
expression of non-infectious particles, viral proteins or viral 
mRNA. Alternatively, it may be possible to use constructs 
derived from the PoEV polynucleotide sequence to act as 
insertional mutagens to knockout the productive infectious PoEV 
in embryos, embryonic stem cells, or cells containing 
totipotential nuclei capable of forming a viable embryo. Thus 
gag, poi and/or env gene "knockouts" may be constructed to allow 
development of breeding programmes in pigs whereby endogenous 
PoEV is substantially prevented or reduced. For example the 
nucleotide sequence of PoEV can be manipulated e.g. by deletion 
of a coding sequence in vitro and the resulting construct used 
to replace the natural PoEV sequence by recombination. Thus, the 
proviral genome can be rendered inactive in the porcine cells. 
The knockouts can be manipulated into embryos and/or stem cells 
and if required manipulated nuclei can be transferred from target 
cells to germ cells using micromanipulation techniques well known 
in the art. The invention also extends to animals derived from 

such germ cells. 

Thus, transgenic pigs may be produced containing anti-sense 



PCT/GB97/01087 

WO 97/40167 



13 



instructs and/or ribozyme constructs capable of downregulating 
the expression of viral proteins, or transgenic pigs expressing 
a single chain immunoglobulin molecule with specificity for PoEV 
proteins or other protein that might interfere with protein 
synthesis or viral assembly may also be produced. similar 
transgenes encoding trans-dominant negative regulators of PoEV 
expression or transgenes encoding competative defective -genomic 
WAS" may be used to reduce or eliminate the production of 
infectious virions. The generation of reagents to suppress the 
expression of native PoEV loci in pigs, such as, by generation 
of antisense nucleic acids (e.g. antisense mRHAs, , ribozymes or 
other antiviral reagents may also be developed. 

The polynucleotide fragment can be molecularly cloned into 
a prokaryotic or eukaryotic expression vector using standard 
techniques and administered to a host. The expression vector is 
taken up by cells and the polynucleotide fragment of interest 
expressed, producing protein. Presentation of the protein on 
cell surface stimulates the host immune system to produce 
antibodies immunoreactive with said protein as part of a defence 
mechanism. Thus, expressed protein may be used as a vaccine. 

inactivated vaccines can be produced from PoEV's or cells 
releasing PoEV . Such infected cells may be generated by natural 
infection or by transfection of a proviral clone of PoEV. It 
will be understood that a proviral clone is a molecular clone 
encoding on at least one antigenic polypept.de of PoEV After 
harvesting the virus and/or the infected cells, viruses or 
infected cells present can be inactivated for example, with 
formaldehyde, glutera Idehyde , acetylethylenimine or other 
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suitable agent or process to generate an inactivated vaccine 
using methods commonly employed in the art. (CVMP Working Party 
on Immunological Veterinary Medicinal Products (1993) . General 
requirements for the production and control of inactivated 
mammalian bacterial and viral vaccines for veterinary use) . Sub 
unit vaccines may be prepared from the individual proteins 
encoded by the gag, pol and env genes. Typically a vaccine would 
contain env gene products either alone or in combination with gag 
genes produced by expression in bacteria, yeast or mammlian cell 
systems . 

Proviral clones of PoEV can be engineered to develop single 
cycle or replication defective viral vectors suitable for 
vaccination using techniques. Such viral vectors known in the 
art (e.g. MuLV Murine Leukaemia Retrovirus, Adenovirus and 
Herpesviruses (Anderson WF. (1992) . Human Gene Therapy. Science 
25 6 , 808-813) may have one or more genes essential for 
replication deleted, with the missing gene function expressed 
constitutively or conditionally from a further, different 
construct which is integrated into the chromosomal DNA of a 
complementing cell line to the proviral PoEV clone. PoEV virions 
released from the cell line may infect secondary target cells in 
the vaccinee but not produce further infectious virus particles. 
For instance, the polynucleotide sequence encoding the reverse 
transcriptase domain of pol can be deleted from the proviral PoEV 
clone and the reverse transcriptase domain of pol integrated into 
the complementing cell line. 

It will be understood that the polynucleotides; 
polypeptides; PoEV free cells, tissues and/or organs encompassed 
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by the present invention could be used in therapy, diagnosis, 
and/or methods of treatment. The polynucleotides; polypeptides ; 
POEV free cells, tissues and/ or organs encompassed by the present 
invention can also be used in the preparation of medicaments for 
use in therapy or diagnosis. 

The cloning and expression of a recombinant PoEV 
polynucleotide fragment also facilitates in producing anti-PoEV 
antibodies and fragments thereof (particularly monoclonal 
antibodies) and evaluation of in vitro and in vivo biological 
activity of recombinant PoEV polymerase and/or envelope 
polypeptides. The antibodies may be employed in diagnostic tests 

for native PoEV virus. 

It will be understood that for the particular PoEV 
polypeptides embraced herein, natural variations can exist 
between individuals or between members of the family Suidae (i.e. 
the pig family). These variations may be demonstrated by (an) 
amino acid dif f erence (s) in the overall sequence or by deletions, 
substitutions, insertions, inversions or additions of (an) amino 
acid(s) in said sequence. All such derivatives showing active 
polymerase and/or envelope polypeptide physiological and/or 
immunological activity are included within the scope of the 
invention. For example, for the purpose of the present invention 
conservative replacements may be made between amino acids within 
the following groups: 

(I) Alanine, serine, threonine; 

(II) Glutamic acid and aspartic acid; 

(III) Arginine and leucine; 

(IV) Asparagine and glutamine; 



WO 97/40167 FCT/GB97/01087 

16 

(V) Isoleucine, leucine and valine; 

(VI) Phenylalanine, tyrosine and tryptophan 

Moreover, recombinant DNA technology may be used to prepare 
nucleic acid sequences encoding the various derivatives outlined 
above . 

As is well known in the art, the degeneracy of the genetic 
code permits substitution of bases in a codon resulting in a 
different codon which is still capable of coding for the same 
amino acid, e.g. the codon for amino acid glutamic acid is both 
GAT and GAA . Consequently, it is clear that for the expression 
of polypeptides with the amino acid sequences shown in Figure 1 
or fragments thereof, use can be made of a derivative nucleic 
acid sequence with such an alternative codon composition 
different from the nucleic acid sequence shown in said Figure 1. 

Furthermore, fragments derived from the PoEV core, 
polymerase and/or envelope polypeptides as depicted in Figure 3, 
which still display PoEV virus core polypeptide, polymerase 
and/or envelope polypeptide properties, or fragments derived from 
the nucleic acid sequence encoding the virus core polypeptides, 
polymerase and/or envelope polypeptides or derived from the 
nucleotide sequence depicted in Figures 1,2,3 and/or 4encoding 
fragments of said virus core polypeptide, polymerase and/or 
envelope polypeptides are also included of the present invention. 
Naturally, the skilled addressee will appreciate within the ambit 
that the said fragments should substantially retain the 
physiological and/or immunological properties of the GAG, POL 
and/or ENV polypeptides. 

The PoEV polynucleotide fragment of the present invention 
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is preferably linked to regulatory control sequences. Such 
control sequences may comprise promoters, operators, inducers, 
enhancers, ribosome binding sites, terminators etc- Suitable 
control sequences for a given host may be selected by those of 
ordinary skill in the art. 

A polynucleotide fragment according to the present invention 
can be ligated to various expression controlling sequences, 
resultinq in a so called recombinant nucleic acid molecule. 
Thus, the present invention also includes an expression vector 
containing an expressible PoEV nucleic acid molecule. The 
recombinant PoEV nucleic acid molecule can then be used for the 
transformation of a suitable host. Such hybrid molecules are 
preferably derived from, for example, plasmids or from nucleic 
acid sequences present in bacteriophages or viruses and are 
termed vector molecules. 

Specific vectors which can be used to clone nucleic acid 
sequences according to the invention are known in the art (e.g. 
Rodriguez, R.L. and Denhadt, D.T., Edit., Vectors: a survey of 
molecular cloning vectors and their uses, Butterworths , 1988). 

The methods to be used for the construction of a recombinant 
nucleic acid molecule according to the invention are known to 
those of ordinary skill in the art and are inter alia set forth 
in Sambrook, et al . (Molecular Cloning: a laboratory manual Cold 
Spring Harbour Laboratory, 1989) . 

The present invention also relates to a transformed cell 
containing the PoEV polynucleotide fragment in an expressible 
form. "Transformation", as used herein, refers to the 

introduction of a heterologous polynucleotide fragment into a 
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host cell. The method used may be any known in the art, for 
example, direct uptake, transfection transduction or electro 
poration (Current Protocols in Molecular Biology, 1995. John 
Wiley and Sons Inc) . The heterologous polynucleotide fragment 
may be maintained through autonomous replication or 
alternatively, may be integrated into the host genome. The 
recombinant nucleic acid molecules preferably are provided with 
appropriate control sequences compatible with the designated host 
which can regulate the expression of the inserted polynucleotide 
fragment, e.g. tetracycline responsive promoter, thymidine kinase 
promoter, SV-40 promoter and the like. 

Suitable hosts for the expression of recombinant nucleic 
acid molecules may be prokaryotic or eukaryotic in origin. Hosts 
suitable for the expression of recombinant nucleic acid molecules 
may be selected from bacteria, yeast, insect cells and mammalian 
cells. 

Since the biological half life and the degree of 
glycosylation of recombinant PoEV virus core polypeptide, 
polymerase and/or envelope polypeptides may be important for use 
in vivo, yeast and baculovirus systems, in which a greater degree 
of processing and glycosylation occur, are preferred. The yeast 
strain Pichia Pastoris exhibits potential for high level 
expression of recombinant proteins (Clare et al., 1991). The 
baculovirus system has been used successfully in the production 
of type 1 interferons (Smith et al., 1983). 

Embodiments of aspects of the present invention will now be 
described by way of example only which are not intended to be 
limiting thereof. 
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F.vample 1 

Preparation of viral RNA 

500ml of supernatant derived from exponentially growing porcine 
kidney cells (PK-15, American Type Culture Collection CCL 33) was 
clarified by centrif ugation of approximately ll.OOOxg for 10 
minutes. Virus was pelleted from the clarified supernatant by 
centrif ugation at approximately I00,oooxg for 60 minutes. The 
supernatant was discarded and the viral pellet retained for the 
preparation of viral RKA genomes. RNA was prepared from the 
virus pellet using a Dynabeads (registered trade mark) mRNA 
Direct Kit according to the manufacturer's protocols; A PoEV 
virus pellet was resuspended in 500^1 of TNE (lOmM Tris HCl 
P H8.0, 0.1M Nad. 1»M EDTA) and the virions disrupted by the 
addition of 2ml of lysis/binding buffer. Dynabeads Oligo(dT) 25 
were conditioned according to the manufacturer's instructions and 
added to the virus disrupted solution. Viral RNA was allowed to 
bind to the Dynabead for 10 minutes before the supernatant was 
removed and the bound RNA was washed three times with washing 
buffer with LiDS (0.5ml) and twice with washing buffer alone. 
The RNA was finally resuspended in 25 M l of elution solution. 
All procedures were performed at ambient temperature. RNase 
contamination was avoided by the wearing of gloves, observation 
of sterile technique and treatment of solutions and non- 
disposable glass and plasticware with diethyl pyrocarbonate 
(DEPC). The RNA was resuspended in DEPC- treated sterile water. 
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Example 2 
Synthesis of cDNA 

cDNA was synthesised from the purified genomic RNA using Great 
Lengths ™ cDNA amplification reverse transcriptase reagents 
(Clontech Laboratories Inc.) following the manufacturer's 
instructions. The RNA was primed with both oligo(dT) and random 
hexamers to maximise synthesis . 

The Great Lengths cDNA synthesis protocol is based on a modified 
Gubler and Hoffman (1983) protocol for generating complementary 
DNA libraries and essentially consists of first-strand synthesis, 
second strand synthesis, adaptor ligation, and size 
f ractionact ion . 

First strand synthesis: lock-docking primers anneal to the 
beginning of the poly-A tail of the RNA due to the presence of 
A, C or a residue at the 3'-end of the primer. This increases 
the efficiency of cDNA synthesis of eliminating unnecessary 
reverse transcription of long stretches of poly-A. In addition, 
the reverse transcriptase used is MMLV (RNase H ) which gives 
consistently better yields than do wild-type MMLV or AMV reverse 
transcriptase . 

Second strand synthesis: the ratio of DNA polymerase I for 
RNase H has been optimised to increase the efficiency of the 
second strand synthesis and to minimize priming by hair pin loop 
formation. Following secoond-s tr and synthesis, the ds cDNA is 
treated with T4 DNA polymerase to create blunt ends. 
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Adaptor ligation: the cDNA is ligated to a specially 
designed adaptor that has a pre-existing EcoRI "sticky end". The 
use of this adaptor, instead of a linker, eliminates the need to 
m ethylate and the EcoRI - digest the cDNA , and thus leaves 
internal EcoRI, sites intact. The adaptor is 5'-phosphorylated 
at the blunt end to allow efficient ligation to the blunt-ended 
cDNA. 

Size fractionation: the ds cDNA is phosphorylated at the 
EcoRI sites and size-fractionated to remove unligated adaptors 
and unincorporated nucleotides. The resulting cDNA is ready for 
cloning into a suitable EcoRI -digested vector. 



Fvample 3 

Molecular cloning of cDNA 

The size fractionated fragment was ligated with EcoR I- digested 
pZErO™ -1 plasmid vector DNA (Invitrogen Corporation, San 
Diego, U.S.) . The ligation mix was used to transform competent 
TOPlOF'cells and these were plated onto L-Agar containing zeocin 
following the manufacturer's instructions (Zero Background- 
cloning kit - invitrogen) . Several of the resulting zeocin 
resistant colonies were amplified in L-Broth containing zeocin 
and the plasmid DNA was purified by alkaline lysis (Maniatis et 
al . , 1982) . 

The plasmid DNA was digested to completion with the 
endonuclease EcoR I and the resulting DNA fragments were 
separated by electrophoresis through an 1.0% agarose gel 
(Maniatis et al . , 1982), in order to check that a fragment in the 
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predicted size fractionated size range had been cloned. A clone 
identified as pPoEV was used in further experimentation. 
Kvample 4 

DNA sequence analysis. 

pPoEV plasmid DNA was purified according to common techniques 
(Sambrook et al, 1989) and sequenced using an ABI automated 
sequencer. Overlapping sequencing primers from both strands of 
the molecular clone were used to determine the nucleotide 
sequence. 

The first sequence obtained is shown in Figure 1. This 
sequence was identified as encoding two ORFs of 924 (nucleotides 
23-2793) and 218 (nucleotides 2642-3297) amino acids, relating 
to the pol and env genes respectively. This sequence was revised 
and updated to the second sequence as shown in Figure 2. This 
second sequence was identified as encoding three ORFs of 516 
(nucleotides 576-2126), 1186 (nucleotides 2143-5733) and 656 
(nucleotides 5606-7576) amino acids, encoding the PoEV gag, pol 
and env genes respectively. This second sequence has since been 
revised and updated to the sequnce shown in Figure 3. This third 
sequence was identified as encoding three ORFs of 524 
(nucleotides 588-2162), 1194 (nucleotides 2163-5747) and 656 
(nucleotides 5620-7590) amino acids, encoding the PoEV gag, pol 
and env genes respectively. 

The differences in the disclosed seqeunces is reflected by 
improvements in carrying out and analysing the sequence obtained. 
However, there is 100% identity at the nucleic acid level, 
between positions 21-2681 of the first sequence and positions 
2972-5653 of the third sequence. Overall there is a 70.5% 
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identity in the entire 3310 bp of the first sequence with a 
corresponding portion of the third sequence. 

There are only 3 base changes between the second sequence 
and the third sequence. These are as follows: 
hase no. ffr-om Figure 2) chanqe 
2121 insertion of a "G" 

2157 insertion of a "G" 

5902 "R" to an "A" 

?700 «M« to an "A" 

The changes at base nos . 5902 and 7700 do not effect the 
corresponding amino acid sequence. However, the changes at 
positions 2121 and 2157 alter the amino acid sequence at the end 
of GAG and the begining of POL. For GAG the final amino acid "S» 
have now been replaced by "VLALEEDKD" . The total product size 
is now 524 amino acids. For POL, the first five amino acids 
"RLGET" have been deleted and replaced by "GRR" . The total 
product size is now 1194 amino acids. 

Similarities were observed between pPoEV and the majority 
of retroviruses determined by using alogrithims from DNASTAR Inc. 
Lasergene software (DNASTAR). The similarities were closest with 
gibbon ape leukaemia virus (GaLV) in the polymerase (pal) 
regions of the pro-virus at 68.5%, in the virus core (gag) 
region, 59.2% and in the envelope (env) region, 39.3% The 
nucleotide sequence and major ORFs of the pPoEV insert are shown 
in Figure 3. The largest ORF (nucleotides 2163-5747) encodes the 
polymerase polypeptide and the smaller ORFs (nucleotides 588-2162 
and 5620-7590) encode the core and envelope polypeptides 
respectively . 
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Example 5 

Purification of cellular DNA from cultured cells, tissues and 
blood. 

Cultured cells 

Cells were maintained in culture and approximately 5 x 10 7 cells 
were harvested for DNA preparation. The cells were pelleted by 
centrif ugation resuspended in phosphate-buffered saline, 
re-centrif uged at lOOOg for 2 minutes and the supernatant was 
discarded . 

Porcine tissues 

Porcine tissue samples were frozen in liquid nitrogen and 
powdered by grinding in a mortar or between metal foil. The 
samples were resuspended in 5ml of extraction buffer consisting 
of 0.025M EDTA (pH 8.0), O.OlMTris.Cl pH 8.0, 0.5% SDS 20^g/ml 
RNAse and 100/xg/ml proteinase K (Maniatis et al . , 1982). 

Porcine blood 

A buffy coat was prepared from the blood samples. 20ml samples 
were centrifuged at lOOOg for 15 minutes. The buffy coat was 
resuspended in buffer and the samples centrifuged at lOOOg for 
15 minutes. The process was repeated one further time. The 
sample was mixed with 5ml (3x volume) of extraction buffer 
(Maniatis et al . , 1982). 

Purification 

The samples (i.e. cultured cells, porcine tissue or porcine blood 
cells) in proteinase K-extraction buffer containing 20^xg/ml RNAse 
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and lOO^g/ml proteinase K were digested for approximately 24 
hours at 37»C. The deproteinised DNA was extracted twice with 
phenol and twice with phenol chloroform and finally precipitated 
by ethanol in the presence of ammonium acetate. The DNA was 
recovered by centrif ugat ion at 3000g for 30 minutes and the 
supernatant discarded (Maniatis et al . , 1982). The pellet was 
washed in 70% ethanol and allowed to air dry for approximately 
1 hour. The DNA was allowed to re-dissolve in Tris EDTA (TE) 
buffer and the purity and concentration of the DNA was assessed 
by spectrophotometry (Maniatis et al . , 1982). 

Fvam ple 6 

Southern blot analysis of porcine tissue and cells 

in order to demonstrate that the molecularly cloned DNA 
comprising the insert from PoEV was derived from the PK-15 cell 
line (American Type Culture Collection CCL33), the DNA was 
hybridised against cellular DNAs and its ability to detect 
proviral DNA was examined. 

DNA purified from oPoEV was radioact ively labelled and used to 
probe a Southern blot of endonuclease digested DNAs derived from 
PK-15 cells . 

The DNAs probed were as follows : 

a) Copy number controls of pPoEV DNA linearized by digestion 
with EcoRI. One copy per haploid cell genome was estimated 
to be 6.84pg. The control was present at an estimated copy 
number of 1 , 5 and 10 copies. 

b) PK-15 DNA. 

t — » rr>m 0 v-i,~3n TvDe Culture Collection 

c) Negative control HeLa (American iype 



WO 97/40167 PCT/GB97/01087 

26 

CCL2 ) DNA derived from a human adenocarcinoma cell line 
harbouring human papillomavirus type 18 DNA. 
d) Negative control SP20 ( European Collection of Animal Cell 
Cultures 85072401) DNA derived from a murine myeloma cell 
line harbouring a xenotropic MuLV retrovirus. 

A hybridisation signal was observed in only the PK-15 porcine 
DNA. No signal was detected in either the negative human or 
murine DNAs . The PK-15 DNA contained more than 10 copies per 
cell with an estimated copy number of 20. The sizes of the 
three major EcoRI- endonuclease digested DNA fragments were 
approximately 3.8Kb, 1.8kb and 0.6kb. The sizes of relevant 
fragments detected in the recombinant pPoEV were comparable. 

There are, as expected, a number of fragments common to the 
genomic DNA of PK-15 and pPoEV DNA and there is agreement 
between the patterns observed in both DNAs digested with Xhol, 
BamHI and Hindlll- However, there are additional fragments 
obtained on digestion of pPoEV DNA by a number of endonucleases . 

pPoEV sequences were also detected in swine testes (American Type 
Culture Collection CRL 1746) and primary porcine kidney cells 
(Central Veterinary Laboratory batch C04495) but not in hamster 
CHOK1 (American Type Culture Collection CCL61) or murine NS0 
myeloma cells (European Collection of Animal Cell Cultures 
85110503) . 
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in order to demonstrate that the molecularly cloned DNA 
comprising the insert from pPoEV could detect sequences in 
porcine cells and tissues in addition to PK-15 the pPoEV DNA was 
hybridised against cellular DNA from tissues derived from pigs 
and its ability to detect proviral DNA was examined (Maniatis et 
al., 1982). 

The DNA purified from pPoEV was radioactively labelled and used 
to probe a Southern blot of endonuclease digested DNAs derived 
from pig organs including liver, Kidney, heart and blood. 

The DNAs probed were as follows : 



a) 



b) 



copy number controls of pPoEV DNA Unearned by digestion 
with EcoRI. One copy per haploid cell genome was estimated 
to be 6.84 P g. The control was present at an estimated copy 
number of 5,10, 20 and 50 copies. 

DNA purified from the porcine tissues digested with 
EcoRI . 



A hybridisation signal was observed in all the porcine DNAs 

The DNAs contained less than 5 copies per cell- There were 
approximately eight distinct bands in each DNA. The sizes of 
the three major endonuclease digested DNA fragments were 
approximately 3. 8Kb, 1.8kb and 0.6kb. 
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Example 7 

Polymerase Chain Reaction (PCR) Amplifications 

Oligonucleotides were selected from the PoEV genome. 



The upstream primer was 5 ' -GGA AGT GGA CTT CAC TGA G-3 ' . 

The downstream primer was 5' -CTT TCC ACC CCG AAT CGG -3'. 

The PCR was performed as described by Saiki et al (1987). One 
1 M 1 of 100ng/fxl template DNA was added to a 49/il reaction mixture 
containing 200 M M of dATP , dCTP, dGTP , dTTP , 30pmol of both 
primers from the pair described above, lunit of DNA polymerase 
and Sy.1 of reaction buffer. The reaction buffer contained 200mM 
Tris-HCl pH 8.4, 500mM potassium chloride and 15mM magnesium 
chloride, ultrapure water. The solution was overlaid with two 
drops of mineral oil to prevent evaporation. Thirty five cycles 
of amplification were performed using a Perkin Elmer Cetus 
thermal cycler. Each cycle consisted of 1 minute, at 95°C to 
denature the DNA, 1 minute, at 53°C to anneal the primers to the 
template and 1 minute, at 72°C for primer extension. After the 
last cycle a further incubation for 10 minutes, at 72°C was 
performed to allow extension of any partially completed product. 
On completion of the amplification, 10^1 of the reaction mixture 
was electrophoresed through a 5 per cent acrylamide gel. The DNA 
was visualised by staining with ethidium bromide and exposure to 
ultraviolet light (320nm). 
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The PCR reaction amplified a sequence cf approximately 787bp 
fro* pPoEV and fro* porcine cells as expected indicating that the 
assay detected the PoEV proviral DNA. There was no specific 
amplification of the expected sequence in cells of non-porcine 
origin and therefore, the PCR reaction and recombinant clone can 
he used as a specific and sensitive diagnostic tool for detection 
of PoEV . 

Two further oligonucleotides were designed against the 3 ' end 
of the pol gene and s< end of the gag gene respectively. 

n <-i*e> S'-GAT GGC TCT CCT GCC CTT TG-3 ' 

The 3' pol oligonucleotide was 5 

^.-j c,.ra TGG AGG CGA AGC TTA AGG— 3 ' 

The 5' gag oligonucleotide was 5 —CGA 

The above oligionucleotide were a!so used in in PCR reactions 
according to the conditions described above, with the exceptions 
that the annealing temperature was 58- and 30 cycles of 
replication were carried out. The PCR reaction amplified a 
seguence of approximately 468bp from PPOEV and from porine cells. 

Example 8 

Production of PoEV polypeptide in Escherichia coli. 
The open reading frame (ORF) encoding the pol peptide was 
isolated from the P PoEV clone and molecularly cloned into the 
plasmid pGEX-4T-l (Pharmacia Ltd.) for expression. 

Two ml cultures of E . coli transformed with various expression 
constructs were grown with shaking at 37°C to late log phase 
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(0-D-6ooom of °- 6 > and induced by the addition of IPTG to 0.1 mM. 
Induced cultures were then incubated for a further 2 hours after 
which the bacteria were collected by centrif ugation. The 
bacterial pellet was lysed by boiling in SDS-PAGE sample buffer 
and the protein profile of the induced bacteria was analysed on 
a 12% acrylamide gel (Laemmli, 1970) followed by staining with 
coomassie brilliant blue dye. 

Example 9 

Isolation and partial sequencing of Ra ii clone 

The aim of the study was to determine whether the human cell line 
"Raji" was susceptible to infection with the PoEV present in 
porcine kidney cells (PK15) . In order to test the capacity of the 
virus for xenotropism, PK15 cells were co-cultured with the B 
lymphoblastoid (Raji) cell line over 20 passages. 

The culture system utilised direct culture and transwells, which 
separated the human and porcine cells, but permitted viruses to 
pass through the separating membrane. After every fifth passage, 
supernatants from the human cell lines are tested for the 
presence of retrovirus by reverse transcriptase assay. 

Cell cultures 

Porcine kidney (PK15) cells (ATCC CCL 33) were used as the source 
of PoEV. The human cells used for co-cultivation with PK15 cells 
were the lymphoblast- 1 ike Burkitts lymphoma Raji (ATCC CCL 86) 
cell line. This cell line does not harbour endogenous 
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retroviruses and lacks reverse transcriptase activity when tested 
by the present inventors. 

Co-cultivation 

Raji cells were co-cultivated directly with PK15 cells in 
duplicate 80cm 2 flasks and exposed to the PK15 cells throughout 
the 20 passage culture period. The cells were passaged twice per 
week and PK15 cells added as necessary from a stock culture. At 
every fifth passage a sample of Raji cells was removed from the 
co-culture, washed and cultured for 3-4 days. Supernatant was 
then harvested and tested for presence of retrovirus by reverse 
transcriptase (RT) assay. 



RESULTS 

The presence of reverse transcriptase activity with a preference 
for the Mn 2 + cation in the supernatant from detector cell 
cultures is suggestive of infection by porcine retrovirus. 
Reverse transcriptase activity with preference for the Mn 2+ 
template was not detected in the duplicate co-cultivated test 
cultures at passage 5 but was detected at passages 10, 15 and 20. 
No significant RT activity was detected in the negative control 
cultures. RT activity with preference for the Mn^ template was 
detected in positive control cultures at passage 5 and 20. 
An infected raji culture was diluted to single cells, and then 
a selection of cells cultured separately such that each culture 
originated from one cell. Each culture was tested by reverse- 
transcriptase assay. 
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Genomic DNA was made from an RT-positive clone as described in 
example 5 -purification. The PoEV ENV region was amplified by PCR 
as described below and the product molecularly cloned into pMOS 
blue T-vector (Amersham) . This molecular clone was then sequenced 
(Fig. 4) . 

PCR 

Oligonucleotides were selected from the PoEV genome. 

The upstream primer was 5' -GAT GGC TCT CCT GCC CTT TG -3' 
5' base position: 5240 

The downstream primer was 5 '-CCA CAG TCG TAC ACC ACG -3' 
5' base position: 8144 

Expected product size: 2904bp 

Approx. 1 Mg of genomic raji clone DNA was added to a 50 /il 
reaction mixture containing 200 of dATP , dCTP, dGTP , dTTP , 

30pM each primer detailed above, lu Taq DNA polymerase and 5jxl 
reaction buffer. The reaction buffer contained 200mM Tris.Cl pH 
8.4, 500mM potassium chloride, 15mM magnesium chloride and 
ultrapure water. The solution was overlaid with two drops of 
mineral oil to prevent evaporation. Thirty cycles of 
amplification was performed followed by an elongated extension 
reaction of 60min. at 72°C. 
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The cycles consisted of: 
95°C 1 min. 
56°C 1 min. 
72°C 2 min. 

^ 4- visualised as described in example 7. 

The PCR product was visuaiiseu 

Product size: -3Kb. 
CLONING 

The PCR product was molecularly cloned into pMOS-Blue T-vector 
as directed by the manufacturer (pMOS-Blue T-vector Kit - 
Amersham) . 

20 transformed colonies (clones) were picked and added to 5xnls 
L -broth containing 50 ampicillin. The cultures were grown 

shaking at 37°C overnight. Plasmid DNA was isolated from each 
clone using the perfect prep plasmid isolation kit as directed 
by the manufacturer (5 Prime - 3 Prime Inc. Boulder, CO, USA). 

Plasmid DNA was digested to completion with the endonucleases 
EcoRI and HindHI and the products visualised on an ethidium 
bromide-stained 1% agarose gel. A clone (raji env clone) showing 

^_ -i-k=.+- nrprticted for 'PK15 cell line 
the same banding pattern as that predicrea 

derived PoEV , was selected for sequencing. 
SEQUENCING 

Raji env clone plasmid DNA prepared above was sequenced using an 
A BI automated sequencer, and the commercially avaxlableT7 
sequencing primer. 
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The entire env gene region of the "Raji" was sequnced (see Figure 
4) and discovered to have substantial sequence identity at both 
the nucleic acid and amino acid levels (98.9% and 96.3% 
respectively) with the PoEV sequence from PK-15. 

Example 10 

Phvlogenetic analysis 

Phylogenetic analysis was performed using the PHYLIP package. 
Sequence distances were calculated using the PROTDIST program 
(Dayhoff matrix) and a neighbour- joining unrooted phylogenetic 
tree reconstructed using the NEIGHBOUR program. 

Bootstrapping was performed using 200 replicates of the pol 
alignment, created using the SEQBOOT program and a consensus tree 
was obtained using the CONSENSE program (see Figure J) . The 
bootstrap percentages are indicated at the branch fork, with 
missing values equal to 100%. The data indicate that PoEV is 
closely related to but distinct from the type-C oncovirus 
typified by gibbon, murine and feline leukaemia viruses. 
A phylogenetic tree was constructed from the pol alignment using 
the maximum likliehood algorithm (Dayhoff matrix). This tree 
differed from the pol NJ tree only in the placement of the BaEV 
lineage in relation to other mammalian type C viruses and 
corresponded to a low bootstrap for the BaEV fork observed in the 
NJ tree. 
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Fvam ple 11 

ana lysis of the t.tr and adjacent region 

Th e long terminal repeat (LTR) is a reiterated sequence at each 
end of the provirus that contains the enhancer and promoter 
governing transcription of the provirus as well as sequences 
required for reverse transcription of the RKA genome and 
integration of the proviral DNA. Three recognised domains of the 
LTR are identifiable, U3 ( R and U5 with the LTR being delineated 
by inverse repeats AATGAAAGG and CCTTTCATT at the 5' and 3' ends 
of U3 and U5 respectively. 

LTR nomain PoEV_Gen^^^ Length^p. 

in accordance with Figure 3 



U3 7638-8106 

R * 8107-8188, 1-61 

U5 62 ' 143 



469 
82 
82 



•The position of the R is defined here by similarity to the 

of the MuLV LTR and is compatible with the observed location of 

a cap site approximately 24 bp downstream of the TATA box. 

The U3 region contans multiple potential transcription sites as 
shown in Figure 6. Host of the U3 region shows little or no 
homology to other mammalian type-C retroviruses which show 
conserved sites or repeat elements. However, there is homology 
« other mammaliann type-C viruses towards the , • end of the U3 

» region and into R and us. Amongst the potential transcription 

factor sites are those for the following: 
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LyF-1 is a transcriptional regulator that interacts with a novel 
class of promoters for lymphocyte-specific-genes (Lo et al 1991) . 

E47 is the prototype member of a new family of tissue specific 
enhancer proteins that have been shown to bind to the enhancer 
of murine leukaemia virus, 

ETS-1 is a transcription factor primarily expressed in the 
haematopoietic lineage . 

The LTR contains direct repeats at 80006-8062 and 8045-8101 which 
together contain three potential CCAATT boxes. A potential TATA 
box is located at position 8129-8144. 

The R region contains a PADS (Poly A downstream element) and 
consensus polyadeny lation signal (AATAAA) . 

The primer binding site (PBS) of PoEV is glycine(2) tRNA which 
has not reported for any exogenous retrovirus. 
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Claims 

1. An isolated polynucleotide fragment: 

(a) encoding at least one porcine retrovirus (PoEV) 
expression product; 

(b) encoding a physiologically active and/or immunogenic 
derivative of said expression product; or 

(c) which is complementary to a polynucleotide sequence as 
defined in (a) or (b) • 

2. An isolated polynucleotide fragment according to claim 1: 

(a) encoding the polymerase (POL) polypeptide; 

(b) encoding a physiologically active and/or immunogenic 
derivative of a polypeptide as described in (a) ; or 

(c) which is complementary to a polynucleotide sequence as 
defined in (a) or (b) . 

3. An isolated polynucleotide fragment according to claim 1: 

(a) encoding the virion core polypeptide (GAG) and/or 
envelope polypeptide (ENV) ; 

(b) encoding a physiologically active and/or immunogenic 
derivative of a polypeptide as described in (a) ; or 

(c) which is complementary to a polynucleotide sequence as 
defined in (a) or (b) . 
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An isolated polynucleotide fragment according to claim 1: 

(a) encoding the virion core polypeptide (GAG) , polymerase 
(POL) and envelope polypeptide (ENV) ; 

(b) encoding a physiologically active and/or immunogenic 
derivative of a polypeptide as described in (a) ; or 

(c) which is complementary to a polynucleotide sequence as 
defined in (a) or (b) . 

An isolated polynucleotide fragment according to any one of 
claims 1 to 4 wherein the polynucleotide fragments is a 
deoxyribose nucleic acid (DNA) fragment. 

6. An isolated polynucleotide fragment according to any 
preceding claim encoding: 

(a) said at least one polypeptide having an amino acid 
sequence which is shown in Figures 3 or 4 ; 

(b) encoding a polypeptide which is a physiologically 
active and/or immunogenic derivative of at least one 
of the polypeptides defined in (a); or 

(c) which is complementary to a polynucleotide sequence as 
defined in (a) or (b) . 

7. An isolated polynucleotide fragment according to any 
preceding claim; 

(a) comprising at least one of the ORFs shown in Figures 
1,2,3 or 4 or comprising a corresponding RNA sequence; 
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(b) comprising a sequence having substantial nucleotide 
sequence identity with a sequence as described in (a) 
above; or 

(c) comprising a sequence which is complementary to a 
sequence as described in (a) or (b) above. 

8. A recombinant nucleic acid molecule comprising a 
polynucleotide fragment according to any one of claims 1 to 
7. 

9. A recombinant nucleic acid molecule according to claim 8 
wherein the recombinant nucleic acid molecule comprises 
regulatory control sequences operably linked to said 
polynucleotide fragment for controlling expression of said 
polynucleotide fragment - 

10. A vector comprising a recombinant nucleic acid molecule 
according to either of claims 8 or 9 . 

11. A vector according to claim 10 which is a virus or a 
plasmid . 



12. A prokaryotic or eukaryotic host cell transformed by a 
polynucleotide fragment, recombinant nucleic acid molecule, 
or vector according to any of claims 1 to 11. 
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13 . A recombinant PoEV polypeptide or derivative thereof 
displaying POL PoEV physiological and/or immunogenic 
activity. 

14 . A recombinant PoEV polypeptide or derivative thereof 
displaying GAG and/or ENV PoEV physiological and/or 
immunogenic activity. 



15. 



16 



A recombinant PoEV polypeptide or derivative thereof 
displaying GAG, POL and ENV PoEV physiological and/or 
immunogenic activity . 

A recombinant PoEV polypeptide according to any one of 
claims 13 to 15 comprising a sequence as shown in Figures 
3 or 4, or functionally active derivative thereof. 

17 . A vaccine comprising a recombinant PoEV polypeptide 
according to any one of claims 13 to 16, or an inactivated 
POEV virus and a pharmaceutical^ acceptable carrier. 



18 . 



An antibody or fragment thereof capable of binding to a 
polypeptide or fragment according to any one of claims 13 
or 16 • 



19. A polynucleotide primer which is PoEV specific. 

20. A polynucleotide probe which is capable of specifically 
hybridising to a PoEV polynucleotide sequence. 
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21. A probe or a primer according to claims 19 or 20 which has 
substantial nucleotide sequence identity with a strand of 
the molecule depicted in Figures 1,2,3 or 4 or a strand 
complementary therewith, with a corresponding RNA molecule, 
or with a part of such a molecule. 



22. A PoEV detection kit comprising a polynucleotide primer or 
probe according to any of claims 19 to 21. 

23. Use of a PoEV specific polynucleotide in the detection of 
PoEV in a sample. 

24. Use of a PoEV specific polynucleotide in a PCR for the 
detection of PoEV in a sample. 

25. A pig modified so as to not express an infectious PoEV 
capable of infecting humans. 

26. Cells, tissues or organs obtainable from a pig accoding to 
claim 25. 

27. Use of a recombinant PoEV polypeptide according to any one 
of claims 13 to 16 in the preparation of a vaccine. 

28. Use of a polynucleotide primer or probe according to any of 
claims 19 to 21 in the preparation of a detection kit 
capable of detecting the presence of PoEV nucleic acid in 
a sample. 
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25 „se of a polynucleotide; polypeptide; ceils, tissues or 
organs according to any one of claims 1 to 7 . 11 to 16 or 
26 in therapy or diagnosis. 

30 A polynucleotide; polypeptide; cells, tissues or organs 

according to any one of claims l to 7, 11 to 16 or 2 6 in 

the preparation of a medicament for use in therapy or 
diagnosis . 



31 



,nhc^ntiallv as hereinbefore described. 
The invention substantially i*« 
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1 GAATTCGCGGCCGCGTCGACAGATGCCTTCTTCTGCCTGAGATTACACCCCACTAGCCAA 60 

6 1 CCACTTTTTGCCTTCGAATGGAGAGATCCAGGTACGGGAAGAACCGGGCAGCTCACCTGG 12 0 

121 ACCCGACTGCCCCAAGGGTTCAAGAACTCCCCGACCATCTTTGACGAAGCCCTACACAGG 18 0 

• ■ • • • * 

181 GACCTGGCCAACTTCAGGATCCAACACCCTCAGGTGACCCTCCTCCAGTACGTGGATGAC 2 4 0 

2 41 CT GCTTCTGGC GGGAGCC ACCAAACAG GACT GCTT AGAAGGT AC GAAGGCACT ACT GCT G 3 0 0 

301 GAATTGTCTGACCTAGGCTACAGAGCCTCTGCTAAGAAGGCCCAGATTTGCAGGAGAGAG 3 60 

361 GTAACATACTTGGGGTACAGTTTGCGGGGCGGGCAGCGATGGCTGACGGAGGCACGGAAG 42 0 

421 AAAACT GTAGTCCAGATACCGGCCCCAACCACAGCCAAACAAGTGAGAGAGTTTTTGGGG 4 3 0 

4 81 ACAGCTGGATTTTGCAGACTGTGGATCCCGGGGTTTGCGACCTTAGCAGCCCCACTCTAC 5 4 0 

541 CCGCTAACCAAAGAAAAAGGGGGATTCTCCTGGGCTCCTGAGCACCAGAAGGCATTTGAT 600 

601 GCTATCAAAAAGGCCCTGCTGAGCGCACCTGC7CTGGCCCTCCCTGACGTAACTAAACCC 660 

6 61 TTTACCCTTTATGTGGATGAGCGTAAGGGAGTAGCCCGAGGAGTTTTAACCCAAACCCTA 7 2 0 
721 GGACCATGGAGGAGACCTGTTGCCTACCTGTCAAAGAAGCTTGATCCTGTAGCCAGTGGT 78 0 

7 81 TGGCCCGTATGTCTGAAGGCTA7CGCAGCTG7GGCCATACTGGTCAAGGACGCTGACAAA 8 4 0 
341 TTGACTTTGGGACAGAATATAACTGTAATAGCCCCCCATGCATTGGAGAACATCGTTCGG 90 0 
901 CAGCCCCCAGACCGATGGATGACCAACGCCCGCATGACCCACTATCAAAGCC7GCTTCTC 9 60 

9 6 1 ACAGAGAGGGTCACTTTCGCTCCACCAGCCGCTCTCAACCCTGCCACTCTTCTGCCTGAA 132 0 
102 1 GAGACTGATGAACCAGTGACTCATGATTGCCATCAACTATTGATTGAGGAGACTGGGGTC 10 8 0 
108 1 CGCAAGGACCTTACAGACATACCGCTGACTGGAGAAGTGCTAACCTGGTTCACTGACGGA 114 0 
114 1 AGCAGCTA7G7GG7GGAAGG7AAGAGGA7GGCTGGGGCGGCAG7GGTGGACGGGACCCGC 12 0 J 
12 01 ACGA7C7GGGCCAGCAGCC7GCCGGAAGGAACTTCAGCGCAAAAGGCTGAGCTCA7GGCC 12 60 
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126 ; ic.CO^CTrrCCOOCT^CC^^TCCAT^C.TTT^CO^CC^ 
1U1 TRT 3CCTTT0C M CT^«c™CAC^CC,TCT^C^==aTT.CTTACC 

M4l TTG CC^ M CT^«T^CACT G i< : CT 5 ««Ti«».C=«^TCT«T* ISOO 
150l TCTACA^C^^CTGACCCGCTT^C^AaicACCCCA^CTCTT^CCTT 

u.i ctscctataatagaaacgcgc^gcgcca^ 

!«1 TOOCAAOAOATAAV^OATA^CC^TTCTCT^crCCO^^CACCTOCTMACC 

lM1 tcatatgggaaggaaatcctgccccacaaagaagggttagaatatgtccaacagatacat I7«. 
„„ ggtgtaagcgacgtaggaagtaaacagctggagcagttggtcagaagatccccttatgat 
lt .l gttctgagggtaccaggagtggctgactcggtggtgaaacattgtgtgcgctgccagctg 
1M i gttaatgctaatgcttggagaatacctgcaggaaagagactaaggggaagcgacccaggc 
1921 gctcactgggaagtggacttcactgaggtaaagccggctaaatacggaaacaaatatcta 
1SS1 ttggtttttgtagacaccttttcaggatgggtagaggcttatcgtactaagaaagagact 
,m tcaaccgtggtggctaagaaaatactggaggaaatttttccaagatttggaatacctaag 

2161 atattggggattgattggaaactgcattgtgcatacagaccccaaagctcaggacaggta 2Z20 
2221 gagaggatgaatagaaccattaaagagacccttaccaaattgaccacagagactggcatt 22 bo 
228 , aatgattggatggctctcctgccctttgtgctttttagggtgaggaacacccctggacag 3 3«o 

„4 , TTTGGGCTGACCCCCTATGAATTGCTCTACGGGGGACCCCCCCCGTTGGCAGAAATTGCC 
„ 0 1 TTTGCACATAGTGCTGATGTGCTGCTTTCCCAGCCTTTGTTCTCTAGGCTCAAGGCGCTC 
2(61 GAGTGGGTGAGGCAGCGAGCGTGGAAGCAGCTCCGGGAGGCCTACTCAGGAGGAGACTTG 2,20 
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2 521 CAAGTTCCACATCGCTTCCAAGTTGGAGATTCAGTCTATGTTAGACGCCACCGTGCAGGA 2 5 8 0 

258 1 AACCTCGAGACTCGGTGGAAGGGACCTTATCTCGTACTTTTGACCACACCAACGGCTGTG 2 64 0 

2 641 AAAGTCGAAGGAATCCCCACCTGGATCCATGCATCCCACGTTAAGCCGGCGCCACCTCCC 27 0 0 

27 01 GATTCGGGGTGGAAAGCCGAAAAGACTGAAAATCCCCTTAAGCTTCGCCTCCATCGCGTG 27 60 

27 61 GTTCCTTACTCTGTCAATAACTCCTCAAGTTAATGGTAAACGCCTTGTGGACAGCCCGAA 2 62 0 

2 8 21 CTCCCATAAACCCTTATCTCTCACCTGGTTACTTACTGACTCCGGTACAGGTATTAATAT 2 8 8 0 

2 8 81 TAACAGCACTCAAGGGGAGGCTCCCTTGGGGACCTGGTGGCCTGAATTATATGTCTGCCT 2 94 0 

2 94 1 TCGATCAGTAATCCCTGGTCTCAATGACCAGGCCACACCCCCCGATGTACTCCGTGCTTA 3 00 0 

3001 CGGGTTTTACGTTTGCCCAGGACCCCCAAATAATGAAGAATATTGTGGAAATCCTCAGGA 3 06 0 

30 61 TTTCCTTTGCAAGCAATGGAGCTGCATAACTTCTAATGATGGGAATTGGAAATGGCCAGT 312 0 

3121 CTCTCAGCAAGACAGAGTAAGTTACTCTTTTGTTAACAATCCTACCAGTTATAATCAATT 318 0 

3181 TAATTATGGCCATGGGAGATGGAAAGATTGGCAACAGCGGGTACAAAAAGATGTACGAAA 32 4 0 

32 4 1 TAAGCAAATAAGCTGTCATTCGTTAGACCTAGATTACTTAAAAATAAGTTTCACTAAAAA 3 3 00 

3301 AAAAAAAAAAAAAAAAAAAA 3320 
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1 TGTGGGCCCCAGCGCGCTTGGAATAAAAATCCTCTTGCTGTTTGCATCAAGACCGCTTCT 60 
61 CGTGAGTGATTTGGGGTGTCGCCTCTTCCGAGCCCGGACGAGGGGGATTGTTCTTTTACT 120 
121 GGCCTTTCATTTGGTGCGTTGGCCGGGAAATCCTGCGACCACCCCTTACACCCGAGAACC 1.0 
181 GACTTGGAGGTAAAGGGATCCCCTTTGGAACGTGTGTGTGTGTCGGCCGGCGTCTCTGTT 240 
241 CTGAGTGTCTGTTTTCGGTGATGCGCGCTTTCGGTTTGCAGCTGTCCTCTCAGACCGTAA 300 
301 GGACTGGAGGACTGTGATCAGCAGACGTGCTAGGAGGATCACAGGCTGCCACCCTGGGGG 360 
361 ACGCCCCGGGAGGTGGGGAGAGCCAGGGACGCCTGGTGGTCTCCTACTGTCGGTCAGAGG 420 
421 ACCGAGTTCTGTTGTTGAAGCGAAAGCTTCCCCCTCCGCGGCCGTCCGACTCTTTTGCCT 4 8 0 
481 GCTTGTGGAAGACGCGGACGGGTCGCGTGTGTCTGGATCTGTTGGTTTCTGTCTCGTGTG ,40 
541 TCTTTGTCTTGTGCGTCCTTGTCTACAGTTTTAATATGGGACAGACAGTGACTACCCCCC 600 
601 TTAGTTTGACTCTCGACCATTGGACTGAAGTTAGATCCAGGGCTCATAATTTGTCAGTTC 660 
661 AGGTTAAGAAGGGACCTTGGCAGACTTTCT GTGCCTCTGAATGGCCAACATTCGATGTTG 720 
721 GATGGCCATCAGAGGGGACCTTTAATTCTGAAATTATCCTGGCTGTTAAGGCAATCATTT 780 
781 TTCAGACTGGACCCGGCTCTCATCCTGATCAGGAGCCCTATATCCTTACGTGGCAAGATT 840 
841 TGGCAGAAGATCCTCCGCCATGGGTTAAACCATGGCTAAATAAACCAAGAAAGCCAGGTC 900 
901 CCCGAATCCTGGCTCTTGGAGAGAAAAACAAACACTCGGCCGAAAAAGTCGAGCCCTCTT 960 
961 CCTCGTATCTACCCCGAGATCGAGGAGCCGCCGACTTGGCCGGAACCCCAACCTGTTCCC 1020 
1021 CCACCCCCTTATCCAGCACAGGGTGCTGTGAGGGGACCTCTGCCCCTCCTGGAGCTCCGG 
10 81 TGGTGGAGGGACCTGCTGCCGGGACTCGGAGCCGGAGAGGCGCCACCCCGGAGCGGACAG 114 0 
1141 ACGAGATCGCGATATTACCGCTGCGCACCTATGGCCCTCCCATGCCAGGGGGCCAATTGC 1200 
1201 AGCCGCTCCAGTATTGGCCCTTTTCTTCTGCAGATCTCTATAATTGGAAAACTAACCATC 1260 
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12 61 CCCCTTTCTCGGAGGATCCCCAACGCCTCACGGGGTTGGTGGAGTCCCTTATGTTCTCTC 132 0 
1321 ACCAGCCTACTTGGGATGATTGTCAACAGCTGCTGCAGACACTCTTCACAACCGAGGAGC 13 8 0 

13 81 GAGAGAGAATT CTGTTAGAG G CT AG AAAAAAT GTT C CT GGG G C C G AC GG G C G AC C C AC G C 14 4 0 

14 41 AGTTGCAAAATGAGATTGACATGGGATTTCCCTTGACTCGCCCCGGTTGGGACTACAACA 1500 

• • • • . 

1501 CGGCTGAAGGTAGGGAGAGCTTGAAAATCTATCGCCAGGCTCTGGTGGCGGGTCTCCGGG 15 60 

15 61 GCGCCTCAAGACGGCCCACTAATTTGGCTAAGGTAAGAGAGGTGATGCAGGGACCGAACG 162 0 
1621 AACCTCCCTCGGTATTTCTTGAGAGGCTCATGGAAGCCTTCAGGCGGTTCACCCCTTTTG 16 8 0 

16 81 ATCCTACCTCAGAGGCCCAGAAAGCCTCAGTGGCCCTGGCCTTCATTGGGCAGTCGGCTC 17 4 0 

17 4 1 TGGATATCAGGAAGAAACTTCAGAGACTGGAAGGGTTACAGGAGGCTGAGTTACGTGATC 18 00 

1 6 0 1 T AGT GAGAG AG GC AGAGAAG GT GT ATT ACAG AAG G G A G AC AG AAG AGG AG AAGGAAC AGA 18 60 

18 61 GAAAAGAAAAGGAGAGAGAAGAAAGGGAGGAAAGACGTGATAGACGGCAAGAGAAGAATT 192 0 
1921 TGACTAAGATCTTGGCCGCAGTGGTTGAAGGGAAGAGCAGCAGGGAGAGAGAGAGAGATT 198 0 

19 81 TTAGGAAAATTAGGT C AGG CCCTAGACAGTCAGGG AACCT GGGC AATAGGAC C C C ACTC G 2 0 4 0 

2 04 1 ACAAG GACC AGT GT G C GT ATT GT AAAGAAAAAG GAC AC T G G G C AAG G AACT G C C C C AAGA 2 100 
2101 AGGGAAACAAAGGACCGAAGTCCTAGCTCTAGAAGAAGATAAAGATTAGGGGAGACGGGT 216 0 
2161 TCGGACCCCCTCCCCGAGCCCAGGGTAACTTTGAAGGTGGAGGGGCAACCAGTTGAGTTC 222 0 
2221 CTGGTTGATACCGGAGCGGAGCATTCAGTGCTGCTACAACCATTAGGAAAACTAAAAGAA 22 8 0 
2 2 61 AAAAAAT CCTGGGTGATGGGTGCC AC AGGGCAACGGC AGT AT CCATGGACTACCCGAAGA 2 3 4 0 
2 3 4 1 ACCGTTGACTTGGGAGTGGGACGGGTAACCCACTCGTTTCTGGTCATCCCTGAGTGCCCA 2 4 0 0 
2 4 01 GTACCCCTTCTAGGTAGAGACTTACTGACCAAGATGGGAGCTCAAATTTCTTTTGAACAA 2 4 6C 
2 4 61 GGAAGACCAGAAGTGTCTGTGAATAACAAACCCATCACTGTGTTGACCC7CCAATTAGA7 2 52 0 
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252l ^^TC^TTCTCCC^^™^™^-^ ».. 
2581 ^TTC=CC«~^C=™™=CC^=««TTCCC 26 .0 

MT ^0==T=^™= S =CC=.T OT ««™C™OC,TC »«, 
2261 OTOT CCOTCC«T=CCCT T O^CTCCCCT=C»CCOOTT^=CCT0O M CC 

2I21 MTGAIT „CO,CC.=T^0«T TO ^=TC«T^OCOTO«CC.C«^C 2... 

28 » CC A CC ST COC^CO™CCTC T TO,=CO==CTCCC 0 CCO«C=^C T =OT.C 2*<° 

„„ .™= W ct™tccct T =t T ctocc TG ,o~ccc.c T ™c, 3000 

3 „„ 3 CTmT OCCTTCO»X0~OC^CO00^CCOO0COO T «CCTC W CC 30,0 

306l c^ococc^o^c^ctcccc^c^c^occct^c^c 3 12 o 

312l CTOOC^^CC^CCCT^O.OCCTCC.CC^CCTOO.TO^C 3300 
, lM CTT „OOCO== M C«CC^O»CTOC m o«OOTACO«=0^CTCCT M », 3 2 <0 
32)1 TT 0TCTO,CCT.OO™C^=CCTCTOCT^OCCC^ T TOC«=^0^ 3300 
3331 ^CTTOOOOT^TCCOOOOCCOCC^TOOCTO.CCO.OOCCO™ 3.00 
336I ,™c»™OCCCC«CC.C M CC™T0 M ^=TT m OOO^ 3..0 
MJl OT =0^ T TOC^CT=TO0,TC=C==O= T TtC00.CC TO O=«=C=C.CTCT,CCC= 3<S0 
3<81 CT „=C^ S ^=O=O=,TT=TCCT0O=C,CCT W 0C.C™c A TTT^=CT 3S.0 
35 „ ^c^OOCCCTOCTOACOOCOCTOCTCTOCCCCCCCTCO^CT^CCCTTT 3000 
3601 .CCCTr^TCTCOArOAOCOT^O^OCCCCOOAOTTT^CCC^CCCT^ 3600 
3661 CC«OO^CCTOT,=CCTACCTOTO^OCTTO,TCCT=TAOC«0,0=TTOO 33.0 
3221 C CCOT«0,=TO«=OC^TCO=,0=T= T 0=CO,T,=TO=T=«0«COCTO,C^T 2 0 33S0 
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37 81 ACTTTGGGACAGAATATAACTGTAATAGCCCCCCATGCATTGGAGAACATCGTTCGGCAG 3 8 4 0 

38 41 CCCCCAGACCGATGGATGACCAACGCCCGCATGACCCACTATCAAAGCCTGCTTCTCACA 3 900 
3901 GAGAGGGTCACTTTCGCTCCACCAGCCGCTCTCAACCCTGCCACTCTTCTGCCTGAAGAG 3 960 

3961 ACT GAT GAAC CAGTGACT CAT GAT T G C CAT CAACTATT GATT GAGGAG ACT G G GGT C C G C 4 02 0 

» • • 

4 02 L AAGGACCTTACAGACATACCGCTGACTGGAGAAGTGCTAACCTGGTTCACTGACGGAAGC 4 08 0 

4 081 AGCTATGTGGTGGAAGGTAAGAGGATGGCTGGGGCGGCAGTGGTGGACGGGACCCGCACG 414 0 

4141 ATCTGGGCCAGCAGCCTGCCGGAAGGAACTTCAGCGCAAAAGGCTGAGCTCATGGCCCTC 4 2 0 0 

42 01 ACGCAAGCTTTGCGGCTGGCCGAAGGGAAATCCATAAACATTTATACGGACAGCAGGTAT 4 2 60 

4 2 61 GCCTTTGCGACTGCACACGTACACGGGGCCATCTATAAACAAAGGGGGTTGCTTACCTCA 4 32 0 

4 321 GCAGGGAGGGAAATAAAGAACAAAGAGGAAATTCTAAGCCTATTAGAAGCCTTACATTTG 4 3 8 0 

4 38 1 CCAAAAAGGCTAGCTATTATACACTGTCCTGGACATCAGAAAGCCAAAGATCTCATATCT 4 4 4 0 

44 4 1 AGAGGGAACCAGATGGCTGACCGGGTTGCCAAGCAGGCAGCCCAGGCTGTTAACCTTCTG 4 5 00 

4501 CCTATAATAGAAACGCCCAAAGCCCCAGAACCCAGACGACAGTACACCCTAGAAGACTGG 4 560 

4 561 CAAGAGATAAAAAAGATAGACCAGTTCTCTGAGACTCCGGAGGGGACCTGCTATACCTCA 4 62 0 

4 62 1 TATGGGAAGGAAATCCTGCCCCACAAAGAAGGG7TAGAATATGTCCAACAGATACATCGT 4 68 0 

4 681 CTAACCCACCTAGGAACTAAACACCTGCAGCAGTTGGTCAGAACATCCCCTTATCATGTT 4 7 4 0 

4741 CTGAGGC7ACCAGGAGTGGCTGACTCGGTGG7CAAACA7TGTGTGCCCTGCCAGCTGGTT 4 800 

4 801 AATGCTAATCC7TCCAGAATACCTCCAGGAAAGAGACTAAGGGGAAGCCACCCAGGCGCT 4S6C 

4 8 61 CACTGGGAAGTGGACTTCACTGAGGTAAAGCCGGCTAAATACGGAAACAAATATCTATTG 4 92 C 

4 921 GTTTTTGTAGACACCTTTTCAGGATGGGTAGAGGCTTATCCTACTAAGAAAGAGACTTCA 4 98 0 

4 96 1 ACCGTGGTGGC7AAGAAAATACTGGAGGAAA7777TCCAAGATTTGGAA7ACCTAAGG7A 504 0 
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5041 ATAGGGTCAGACAATGGTCCAGCTTTCGTTGCCCAGGTAAGTCAGGGACTGGCCAAGATA 5 1O0 

5101 TTGGGGATTGATTGGAAACTGCATTGTGCATACAGACCCCAAAGCTCAGGACAGGTAGAG 5160 

5161 AG GAT G AAT AGAAC C ATT AAAG AGAC C CTTACCAAATT G AC CACAG AGACT G G C ATT AAT 5220 

5221 GATTGGATGGCTCTCCTGCCCTTTGTGCTTTTTAGGGTGAGGAACACCCCTGGACAGTTT 52 8 0 

5281 GGGCTGACCCCCTATGAATTGCTCTACGGGGGACCCCCCCCGTTGGCAGAAATTGCCTTT 534 0 

5341 GCACATAGTGCTGATGTGCTGCTTTCCCAGCCTTTGTTCTCTAGGCTCAAGGCGCTCGAG 54 00 

5401 TGGGTGAGGCAGCGAGCGTGGAAGCAGCTCCGGGAGGCCTACT CAGGAGGAGACTTGCA^i 54 60 

54 61 ' GTTCCACATCGCTTCCAAGTTGGAGATTCAGTCTATGTTAGACGCCACCGTGCAGGAA^C 552 0 

5521 CTCGAGACTCGGTGGAAGGGACCTTATCTCGTACTTTTGACCACACCAACGGCTGTGAAA 5580 

5581 GTCGAAGGAATCCCCACCTGGATCCATGCATCCCACGTTAAGCYGGCGCCACCTCCCGAC 564 0 

5641 TCGGGGTGGAGAGCCGAAAAGAcTGAGAATCCCCTTAAGCTTCGCCTCCATCGCCTGGTT 57 0 0 

5701 CCTTACTCTAACAATAACTCCCCAGGCCAGTAGTAAACGCCTTATAGACAGCTCGAACCC 5760 

57 61 CCATAGACCTTTATCGCTTACCTGGCTGATTATTGACCCTGATACGGGTGTCACTGTAAA 5 82 0 

5821 TAGCACTCGAGG7GTTGCTCCTAGAGGCACCTGGTGGCCTGAACTGCATTTCTGCCTCCG 583 0 

5 8 81 ATTGATTAACCCCGCTGTTAARAGCACACCTCCCAAC-TAGTCCGTAGTTATGGGTTCTA 5 9 4 G 

5941 TTGCTGCCCAGGCACAGAGAAAGAGAAATACTGTGGGGGTTCTGGGGAATCCTTCTG7AG 6000 

6001 GAGATGGAGCTGCGTCACCTCCAACGA7GGAGACTGGAAATGGCCGATCTCTCTCCAGGA 6060 

60 61 CCGGGTAAAATTCTCCTTTGTCAATTCCGGCCCGGGCAAGTACAAAATGATGAAACTATA 6 12C 

6121 TAAAGATAAGAGC7GCTCCCXATCAGACTTAGATTATCTAAAGA7AAGTTTCACTGAAAG 6 1 8 C 

618 1 GAAAACAGGAAAA7ATTCAAAAGTGGATAAATGGTATGAGCTGGGGAATAGTTTTTTAT7 62 4 0 
62 4 1 ATATGGCGGGGGAGCAGGGTCCACTTTAACCATTCGCCTTAGGATAGAGACGGGGACAGA 6 3 0 0 
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6301 


ACCCCCTGTGGCAATGGGACCCGATAAAGTACTGGCTGAACAGGGGCCCCCGGCCCTGGA 


"Ten 


6361 


GCCACCGCATAACTTGCCGGTGCCCCAATTAACCTCGCTGCGGCCTGACATAACACAGCC 


O *4 ^ U 


6421 


GCCTAGCAACAGTACCACTGC3ATTGATTCCTACCAACACGCCTAGAAAC-CCCCAGGTGT 


C>) O A 

DSOU 


6461 


TCCTGTTAAGACAGGACAGAGACTCTTCAGTCTCATCCAGGGAGCTTTCCAAGCCATrAA 


654 0 


6541 


CTCCACCGACCCTGATGCCACTTCTTCTTGTTGGCTTTGTCTATCCTCAGGGrrTrrTTn 


6600 


6601 


TTATGAGGGGATGGCTAAAGAAAGAJsj\ATTCAATGTGACCAAAGAGt*^AT Ar A atTratrr 


6660 


6661 


^ACATGGGGGTCCCGAAATAAGCTTACCCTCAC^GAAGT^~CCGGGAAr "r* r - Tr-z 


672 0 


6721 


AGGAAAAGCTCCCCCATCCCACCAACACC7TTGCTATAGTArTrTr-TT^iLTrh--«r^^ 

^ r ^ vv -"^ ***■ ** » u*- i AiAiji M 1 ^- iijibol 1 I ATGAGCAGGC 


6780 


6761 


CTCAGAAAATCAGTATTTAGTACCTGGTTATAACAGf;Tr;rTrrrr2iTr~/~a a T ArTrrrw 


6840 


6841 


AACCCCCTGTGTTTCCACCTCAGTCTTCAACCAATCrAAArixT^rTr-r-rnTrr^n^^ 

^AncLnAi ^l.m>\m.vj/\ i ; i CTG id CATGGTCCA 


6900 


6901 


AATCGTCCCCCGAGTGTACTACCATCCTGAGGAAGTGGTCCTTGATrAA"-Trar-r.tTr- 


6960 


6961 


GTATAACCGACCAAAAAGAGAACCCGTATCCCTTACCCTAGC^GTAATGr^^ 1 — a--rarr 


7 02 0 


7021 


GACGGCCGTTGGCGTAGGAACAGGGACAGCTGCCCTGATCACAGGACCACA.Gr AT' — "ara 


7 0 8 0 


70B1 


GAAAGGACTTGGTGAGCTACATGCGGCCATGACAGAAGATC^CCGAr,rr— aaarrar^r 


7 14 0 


7141 


TGTTAGCAACCTAGAAGAGTCCCTGACTTCTTTGTCTGAAGTGGTTCT ACAGAACCr;r : r 


*7 T A A 

/ 2 00 


7201 


GGGATTAGATCTGCTGTTTCTAAGAGAAGGTGGGTTATGTGCAGCCTTAAAAGAAGAA-G 


*7 ~> a n 


7261 


TTGCTTCTATGTAGATCACTCAGGAGCCATCAGAGACTCCATGAACAAGCT^AGAAAAAA 


i o 1 r\ 
/ J Z U 


7321 


GTTAGAGAGGCGTCGAAGGGAAAGAGAGGCTGACCAGGGGTGGTTTGAAGGATGG-TCAA 


/ J o U 


7381 


CAGGTCTCCTTGGATGACCACCCTGCTTTCTGCTCT GACGGGGCCCCTAGTAGTCC7GCT 


74 40 


7 4 4 1 


CCTGTTACTTACAGTTGGGCCTTGCTTAATTAATAGGTTTGTTGCCTTTGTTAGAGAACG 


7500 


7 501 


AGTGAGTGCAGTCCAGATCATGGTACTTAGGCAACAGTACCAAGGCCTTCTGAGCCAAGG 
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"7 561 AGAAACTGACCTCTAGCCTTCCCAGTTCTAAGATTAGAACTATTAACAAGACAAGAAGTG 7 62 0 

7 621 GGGAATGAAAGGATGAAAATGCAACCTAACCCTCCCAGAACCCAGGAAGTTAATAAAAAG 7 680 

7 681 CTCTAAATGCCCCCGAATTMCAGACCCTGCTGGCTGCCAGTAAATAGGTAGAAGGTCACA 774 0 

7741 CTTCCTATTGTTCCAGGGCCTGCTATCCTGGCCTAAGTAAGATAACAGGAAATGAGTTGA 7 8 00 

7 B 0 1 CTAATCGCTTATCTGGATTCTGTAAAACTGACTGGCACCATAGAAGAATTGATTACACAT 7 8 60 

7 661 TGACAGCCCTAGTGACCTATCTCAACTGCAATCTGTCACTCTGCCCAGGAGCCCACGCAG 7 92 0 

7 521 ATGCGGACCTCCGGAGCTATT7TAAAATGA7TGGTCCACGGAGCGCGGGCTCTC3ATATT 7 9 3 0 

7 S 6 1 7TAAAATGATTGGTCCATGGAGCGCGGGCTC7CGATATTTTAAAATGATTGGTTTGTGAC 8 0 4 0 

8 041 GCACAGGCTTTGTTGTGAACCCCATAAAAGCTGTCCCGATTCCGCACTCGGGGCCGCAG7 8100 
8101 CCTCTACCCCTGCGTGGTGTACGACTGTGGGCCCCAGCGCGCTTGGAATAAAAATCCTCT 8160 
8161 T3CTGTTTGCATCAAAAAAAAAAAAAAAAAAAAAAA 9196 
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1 GTGGTGTACGACTGTGGGCCCCAGCGCGCTTGGAATAAAAATCCTCTTGCTGTTTGCATC 60 

61 AAGACCGCTTCTCGTGAGTGATTTGGGGTGTCGCCTCTTCCGAGCCCGGACGAGGGGGAT 12 0 

121 TGTTCTTTTACTGGCCTTTCATTTGGTGCGTTGGCCGGGAAATCCTGCGACCACCCCTTA 180 

1 8 1 CACCCGAGAACCGACTTGGAGGTAAAGGGATCCCCTTTGGAACGTGTGTGTGTGTCGGCC 

241 GGCGTCTCTGTTCTGAGTGTCTGTTTTCGGTGATGCGCGCTTTCGGTTTGCAGCTGTCCT 

3 0 1 CTCAGACCGTAAGGACTGGAGGACTGTGATCAGCAGACGTGCTAGGAGGATCACAGGCTG 3 6 0 

3 6 1 CCACCCTGGGGGACGGCCCGGGAGGTGGGGAGAGCCAGGGACGCCTGGTGGTCTCCTACT 4 2 0 

421 GTCGGTCAGAGGACCGAGTTCTGTTGTTGAAGCGAAAGCTTCCCCCTCCGCGGCCGTCCG 4 8 0 

481 ACTCTTTTGCCTGCTTGTGGAAGACGCGGACGGGTCGCGTGTGTCTGGATCTGTTGGTTT 54 0 

54 1 CTGTCTCGTGTGTCTTTGTCTTGTGCGTCCTTGTCTACAGTTTTAATATGGGACAGACAG 600 

MetGlyGlr.ThrV 

60 1 T =ACTACCCCCCTTAGTTTGACTCTCGACCATTGGACTGAAGTTAGATCCAGGGC-CAT> 

7 " ™S -o 

781 AGGCAATCATTTTTCAGACTGGACCCGGCTCTCATCCTGATCAr.r. aG n^.-. A ^, ; 

^^^^I^^eGlnThrGlyProGlySerHisProAspGlnGlu^^y;?!^;,:? 



Il^o CC ; CTT:CTCGTATCTACCCCGAGATCGA GGAGCCGCCGACTTGGCCGGAAC" ■ 
alGi -- D "Se rS erS e rTyrLeuProArgA SP ArgGlyAlaAlaA S pL,uA^Gr^n:; 

y f^-eiioer.erihrSlyCysCysGiuGlyThrSerAiaPrcr- 
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»•* 11,0 

»« s=^r==== 1200 

1201 GGGGCCAATTGCAGCCCCTCCAGTATTGGCCCTTTTCTTCTGCAGATCTCTATAA.TTGGA 1260 
iyG?yG^IeuGlnProLeuGlnTyrTr P ProPheSerSerAlaA SP LeuTyrAsnTrpL 

1261 AAACTAACCATCCCCCTTTCTCGGAGGATCCCCAACGCCTCACGGGGTTGGTGGAGTCCC 1320 

1321 TTATGTTCTCTCACCAGCCTACTTGGGATGATTGTCA^CAGCTGCTGCAGACACTCTTCA 13 8 0 
1321 ™^2erHisGlnProThrTtpA.pA-pCy»GlnGlnL.uL.uGlnThrL«uPh.T 

nHl CAACCGAGGAGCGAGAGAGAATTCTGTTAGAGGCTAGAAAAAATGTTCCTGGGGCCGACG 144 0 
1 ^hrSSSSrgSSrglleLeuLeuGluAlaArgLysAsnValProGlyAlaAspG 

, 44-1 rrrr JxrcCACGCAGTTGCAAAATGAGATTGACATGGGATTTCCCTTGACTCGCCCCGGTT 1500 
1441 ? y ArgProT"hr^ 

1501 GGGACTACAACACGGCTGAAGGTAGGGAGAGCTTGAAAATCTATCGCCAGGCTCTGCT 1560 
cpAspTyrAsnThrAlaGluGlyArgGluSerLeuLysIleTyrArgGlnAlaLeuValA 

1561 CGGGTCTCCGGGGCGCCTCAAGACGGCCCACTAATTTGG^ ^20 
laGlyLeuArgGlyAlaSetArgArgProThrAsnLeuAlaLysValArgGluValMerG 

1621 AGGGACCGAACGAACCTCCCTCGGTATTTCTTGAGAGGCTCATGGAAGCCTTC 1680 
InGlyProAsnGluProProSerValPheLeuGluArgLeuMetGluAlaPheArgArgP 

1681 TCACCCCTTTTGATCCTACCTCAGAGGCCCAGAAAGCCTCAGTGGCCCTGGCCTTCAT7G 17 4 0 
he^rProPheLpProThrSerGluAlaGlnLysAlaSerValAlaLeuAlaPhelleG 

1741 GGCAGTCGGCTCTGGATATCAGGAAGAAACTTCAGAGACTGGAAGGGTTACAGGAGGCTG 1300 
lyGlnSerAlaLeuAspIleArgLysLysLeuGlnAtgLeuGluGlyLeuGlnGluAlaG 

1801 AGTTACGTGATCTAGTGAGAGAGGCAGAGAAGGTGTATTACAGAAGGGAGACAG i960 
luLeuAirgAspLeuValArgGloAlaGluLysValTyrTyrArgArgGluThrGluGluo 

18 61 AG AAGG AAC AG AGAAAAGAAAAGGAG AG AG AAGAAAGG GAGGAAAGACGT ^^^^^ ^ ^ 1 * 2 ° 
luLysGluGlnArgLysGluLysGluArgGluGluArgGluGluAr gArgAspAr gArgG 

i 9 51 AAGAGAAGAATTTGACTAAGATCTTGGCCGCAGTGGTTGAAGGGAAGAGCAGCAGGGAGA 1930 
TnGluLysAsnLeuThrLysIleLeuAlaAlaValValGl.GlylysSerSerArgGl^ 

158 1 GAGAGAGAGAT7TTAGGAAAATTAGGTCAGGCCCTAGACAGT ^^^^^^^y^^ = °* ° 
rgGluArgAspPheArgLysIleArgSerGlyProArgG-noerGiyAsnLeuGlyAsn.. 
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2041 y^^wpp^^^^^^^^^^^^^^^^^'^^^'^'^^^^^^^^GACACTGGGCAAGGA 210 0 
rgThrProLeuAspLysAspGlnCysAlaTyrCysLysGluLysClyHisTrpAlaArgA 

2101 ACTGCCCCAAGAAGGGAAACAAAGGACCGAAGgTCCTAGCTCTAGAAGAAGATAAAGATT 2 i a n 
snCy S ProLysLy S Gl y A 3 nL yS Gl y ProLy S ValL e uAlaLeuGluGluAVp^SpE 

2161 ^?S^A G ^ T I CG f VCCCCCTCCCCGAGCCCAGGG ^ CTTTG ^ GG TGGAGGGGC ?220 
ndGlyArgArgGlySerA^pProLeuProGIuProArgValThrLeuLysValGluGlyG 

2221 AACCAGTTGAGTTCCTGGTTGATACCGGAGCGGAGCATTCAGTGCTGCTACAACCATTAG ??ftn 
InProValGluPheLeuValA.pThrGlyAlaGluHisSerValJuSuG^Pr^IiG 

2281 GAAAACTAAAAGAAAAAAAATCCTGGGTGATGGGTGCCACAGGGCAACGGCAGTATCCA- 23 4 0 
lyLysLeuLysGluLysLysSerTrpValM.tGlyAlaThrGlyGlnArgGlnTyrProT 

2341 GGACTACCCGAAGAACCGTTGACTTGGGAGTGGGACGGGTAACCCACTCGTT7C-GGTCA -4 00 
^ThrThrArgArgThrValA S pi.euGlyValGl y ArgValThzHi 5 SerPh;i;^i °° 

2 4 01 TCCCTGAGTGCCCAGTACCCCTTCTAGGTAGAGACTTACTGACCAAGATGGGAGCTCAA 
leProGluCysProValProLeuLeuGlyArgAspLau^uThrLysMetGlyAiaGln 

2 4 61 777 C 7"777GAACAAGGAAGACCAGAAG7G7C7G7GAA7AACAAACCCA7CAC7G7G77GA 
leSecPheGluGlnGlyArgProGluValSerValA^nAsnLysProIleThrValLeu? 

2521 CCCTCCAATTAGATGATGAATATCGACTATATTCTCCCCAAGTAAAGCCTGATCAAGATA 2580 
hrLeuGlnLeuAspAspGluTyrArgLeuTyrSerProGlnValLysProAspGlnAipt 

25 81 TACAGTCCTGGTTGGAGCAGTTTCCCCAAGCCTGGGCAGAAACCGCAGGGATGGGTTTGG -64 0 
leGlnSerTrpLeuGluGlnPhePtaGlnAlaTrpAlaGluThrAlaGlyMetGlyLeuI 

2641 CAAAGCAAGTTCCCCCACAGGTTATTCAACTGAAGGCCAGTGCTACACCAGTAT CAG7CA 27 00 
la^GlnValProProGlnVallleGlnLeuLysAlaSerAlaThrProValS^rVaJi 

27 01 GACAGTACCCCTTGAGTAGAGAGGCTCGAGAA.GGAATTTGGCCGCATGT7CAAAGATTAA -7 60 
rgGlr.TyrProLeuSerArgGluAlaArgGluGlyXieTrpPr.H.sVa-G^gle^ 

2 7 61 TCCAACAGGGCATCCTAGTTCCTGTCCAATCCCCTTGGAA7ACTCCCCTGCTACCGGTTA - ^ o 
leGlnGlnGlyIi e LeuValProValGlnSer?roTr P A 5 nThrProL^i: A ^c5II A 

2821 rSvS"S GA K CAAT f T I AT r G ACCAG ' ACAGG ^- ^AGAGAGGTCAATAAAAGGG 2860 
rgLysProGlyThrAsnAspTyrArgProValGlrAscLeuArgGiuVa.AsnLysArgV 

2 8 81 TGCAGGACATACACCCAACGGTCCCGAACCC7TATAACCTC7TGAGCGCCCTCCCGCCTG - 9 4 - 

2 54 1 AACGGAAC7GG7ACACAG7A77GGACT7AAAAGATGCC77C77C7GCC7GAGA77^CA- 3000 
lu^rgA S n7r P Tyr7nrValLeuA S p^uLy S A S pAlaPr.*PheCy S LeuArgI; A H A :; 



^GC7CAAA 2 4 60 



2520 
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3001 CCACTAGCCAACCACTTTTTGCCTTCGAATGGAGAGATCCAGGTACGGGAAGAACC3GGC 3060 
rShrSerGlnProLeuPheAlaPheGluTrpArgAspProGlyThrGlyArgThrG.yG 

3061 AGCTCACCTGGACCCGACTGCCCCAAGGGTTCAAGAACTCCCCGACCATCTTTGACGAAG 312 0 
3 SShSr^hrLgLeuProGlnGlyPheLysAsnSerProThrllePheAspGluA 

3171 rcCTACACAGGGACCTGGCCAACTTCAGGATCCAACACCCTCAGGTGACCCTCCTCCAGT 319 0 
3121 ^L^Sg2p2uAlaAsnPheAr g lleGlnHi S ProGlnValThrLeul.euGlnT 

?iai ACGTGGATGACCTGCTTCTGGCGGGAGCCACCAAACAGGACTGCTTAGAAGGTACGAAGG 324 0 

3241 CACTACTGCTGGAATTGTCTGACCTAGGCTACAGAGCCTCTGCTAAGAAGGCCCAGATTT 3 300 
l,L.uL.uLeuGluLeuSecA S pLeuClyTyrArgAl«SerAlaLysLysAl*Gln..eC 

3301 GCAGGAGAGAGGTAACATACTTGGGGTACAGTTTGCGGGGCGGGCAGCGATGGCTGACGG 3360 
ysArgArgGluValThrTyrLeuGlyTyrSerieuArgGlyGlyGloArgTrpLeu.r.rG 

,,,, AGGCACGGAAGAAAACTGTAGTCCAGATACCGGCCCCAACCACAGCCAAACAAGTGAGAG 3 420 
3361 ^aArg^^h^IwalGlnlleProAlaProThrThrAlaLysGlnValArgG 

3,21 AGTTTTTGGGGACAGCTGGATTTTGCAGACTGTGGATCCCGGGGTTTGCGAC^ 3480 
luPheLeuGlyThrAlaGlyPheCysArgLeuTrpIlePtoGlyPheAlaThrLeuA^aA 

3481 CCCCACTCTACCCGCTAACCAAAGAAAAAGGGGGATTCTCCTGGGCTCCTGAGC 3540 
laProLeuTyrProLeuThrLysGluLysGlyGlyPheSerTrpAlaProGluHxsG.nL 

3541 AGGCATTTGATGCTATCAAAAAGGCCCTGCTGAGCGCACCTGCTCTGGCCCTCCCT^ 3600 
ysAlaPheAspAlalleLysLysAlaLeuLeuSerAxaProAlaL^uAla^uPr-As.V 

3601 TAACTAAACCCTTTACCCTTTATGTGGATGAGCGT ^f^f ^^TT;^ 3 * ' ° 
alThrLysProPheThrLeuTyrValAspGluArg'-ysGlyValAlaArgGlyVal-euT 

3661 CCCAAACCCTAGGACCATGGAGGAGACCTGTTGCCTACCTGTCAAAGAAGCTTGATCCT3 3720 
HrGlnThrLeuGlyProTrpArgArgProValAlaTyELeuSerLysLysLeuAspProV 

37 21 TAGCCAGTGGTTGGCCCGTATGTCTGAAGGCTATCGCAGCTGTGGCCATACTGGTCAAGG 37 6 0 

alAlaSerGlyTrpProValCysLeuLysAlalleAlaAlaValAialleLeuVaiLysA 

3761 ACGCTGACAAATTGACTTTGGGACAGAATATAACTGTAATAGCCCCCCATGCATTGGA 3840 
spAlaAspLysLeuThtLeuGlyGlnAsnlleThrVallleAlaProHisAiaLeuGx.-A 

38 4^ ACATCGTTCGGCAGCCCCCAGACCGATGGATGACCAACGCCCGCATGACCCACTATCAAA 3 900 

snIleV.lAegGlnPtoPteAspArgTrpMetThrA5nAl.ArgMetThrHi5TyrGlr. 3 

3*01 GCCTGCTTCTCACAGAGAGGGTCACTTTCGCTCCACCAGCCGCTCTCAACCCTGCCACTG 3960 
erLeuLeuLeuThrGluArgVaiThrPheAlaProProAlaAla.eunsnProAlaTh.^ 
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3961 TTCTGCCTGAAGAGACTGATGAACCAGTGACTCATGATTGCCATCAACTATTGATTGAGG 4 02 0 
euLeuProGluGluThrAspGluProValThrHisAspCysHisGlnLeuLeuIleGluG 

4021 AGACTGGGGTCCGCAAGGACCTTACAGACATACCGCTGACTGGAGAAGTGCTAACCTGGT 4 080 
luThrGlyVaiArgLysAspLeuThrAspIleProLeuThrGlyGluValLeuThrTrpP 

4 081 TCACTGACGGAAGCAGCTATGTGGTGGAAGGTAAGAGGATGGCTGGGGCGGCAGTGGTGG 414 0 
h*ThrAspGlySerSerTyrValValGluGlyLysArgM«tAlaGiyAlaAlaValValA 

4141 ACGGGACCCGCACGATCTGGGCCAGCAGCCTGCCGGAAGGAACTTCAGCGCAAAAGGCTG 42 00 
spGlyThrArgThrlleTrpAlaSerSerLeuProGluGlyThrSerAlaGlnLysAlaG 

42 01 AGCTCATGGCCCTCACGCAAGCTTTGCGGCTGGCCGAAGGGAAATCCATAAACATTTATA 4 2 60 
luLeuMecAlaLeuThrGloAlaL^uArgLeuAlaGluGlyLysSerlleAsnlleTyrT 

42 61 CGGACAGCAGGTATGCCTTTGCGACTGCACACGTACACGGGGCCATCTATAA^CAAAGGG 4 32 0 
^AspSerArgTyrAlaPheAlaThrAlaHisValH^sGlyAlalleTyrLysGlnArgG 

4 321 GGTTGCTTACCTCAGCAGGGAGGGAAATAAAGAACAAAGAGGAAATTCTAAGCCTATTAG 4 3S0 
lyLeuLeuThrSerAlaGlyArgGluIleLysAsnLysGluGluIieLeuSerLeuLeuG 

4 381 AAGCCTTACATTTGCCAAAAAGGCTAGCTATTATACACTGTCCTGGACATCAGAAAGCCA 44 4 0 
luAlaLeuHisLeuProLysArgLeuAlallelleHisCysProGlyHisGInLysAlaL 

4 4 41 AAGATCTCATATCTAGAGGGAACCAGATGGCTGACCGGGTTGCCAAGCAGGCAGCCCAGG 4 5 00 
ysAspLeuIleSetArgGlyAsnGinMeCAlaAspArgValAlaLysGlnAlaAlaGlnA 

4 5 01 CTGTTAACCTTCTGCCTATAATAGAAACGCCCAAAGCCCCAGAACCCAGACGACAGTACA 4 560 
laValAsnLeuLeuProIlelieGluThrProLysAlaProGluProArgArgGlnTyrT 

4561 CCCTAGAAGACTGGCAAGAGATAAAAAAGATAGACCAGTTCTCTGAGACTCCGGAGGGGA 4 62 0 
hrLeuGluA^pTrpGlnGluIleLysLysUeAspGlnPheSerGluThrPrcGluGlyT 

4621 ^c^™™"!^™^ 5 ^ 2 ^ 4 66 0 

nrCysTyrThrSerTyrGlyLysGluIleLeuPro.HxsLysGluGlyLeuGl -yrValG 

4681 ^^T^^f 4 7 4 0 

InGlnlleKisArgLeuThrHisLeuGlyThrLysHasLeuGlnGlnLeuVa.ArgThrS 

4 74 1 CCCCTTATCATGTTCTGAGGCTACCAGGAGTGGCTGACTCGGTGGTCAAACATTGTGTGC 4 8 00 
erProTyrHlsValLeuArgLeuProGlyVaiAlaAspSerValValLysHisCysValP 

4 8 C 1 CCTGCCAGCTGGTTAATGCT.AATCCTTCCAGAArACCTCCAGGAAAGAGACT.A^GGGGAA 4 8 6 0 
roCysGlnLeuValAsnAlaAsnProSerArglleProProGly^vsArgLe^rgGlys' 

4 6 6 : ^ CAC p CAGGCGCTGAC ; GGGAAGTGGACT ' CA ^GAGGTAAAGCCGGCT^ , 52 n 

erHisPcoG.yA.aKisTcpG.^VaiAspPheThrGiuValLysPsoAlaLysTytGlyA 
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4 921 ACAAATA7CTATTGGTTTT7G7AGACACCTTTTCAGGA7GGG7AGAGGC77A7CC7AC7A 4 9 8 0 
snLysTycLeuLeuValPheValAspThrPheSerGlyTrpValGluAlaTyrProThrl 

4 981 AGAAAGAGACT7CAACCGTGGTGGCTAAGAAAATAC7GGAGGAAA7TT7TCCAAGAT77G 504 0 
ysLysGluThrSerThrValValAlaLysLysIleLeuGluGluIlePheProArgPheG 

5041 GAATACCTAAGGTAATAGGGTCAGACAATGGTCCAGCTTTCGTTGCCCAGGTAAGTCAGG 5100 
4 lylleProLysVallleGlySerAspAsnGlyProAlaPheValAlaGlnValSerGlnG 

5101 GACTGGCCAAGA7ATTGGGGATTGA7TGGAAACTGCATTG7GCATACAGACCCCAAAGCT 5160 
lyLcuAlaLysIleLeuGlylleAspTrpLysLeuHxsCysAlaTyrArgProGlnSerS 

5161 C AG GAC AG GT AG AGAGG AT G AA7 AG AAC CATT AAAG AGA C C CTT AC C AAATT G AC C AC AG 522 0 
erGlyGlnValGluArgMetAsnArgThrlleLysGluThrLeuThrLysLeuThrThrG 

52 2 1 AGACTGGCAT7AATGAT7GGATGGCTC7CC7GCCCTT7G7GCTTTTTAGGGTGAGGAACA 52 3 0 
luThrGIylleAsnAspTrpMecAlaLeuLeuProPheVaiLeuPheArgValArgAsnT 

5261 CCCCTGGACAGTTTGGGCTGACCCCCTATGAATTGCTCTACGGGGGACCCCCCCCGTTGG 5 34 0 
hrProGlyGlnPheGlyLeuThrProTyrGluLeuLauTyrGlyGlyProProProLeuA 

5341 CAGAAATTGCCTTTGCACATAGTGC7GATGTGCTGCTTTCCCAGCCTTTGTTCTCTAGGC 54 00 
laGluIleAla'PheAlaHisSerAlaAspValLeuLeuSecGlnProLeuPheSerArgL 

5401 TCAAGGCGCTCGAGTGGGTGAGGCAGCGAGCGTGGAAGCAGCTCCGGGAGGCCTACTCAG 54 60 
euLysAlaLeuGluTrpValArgGlnArgAlaTrpLysGlnLeuArgGluAlaTyrSerG 

5461 GAGGAGAC7TGCAAGT7CCACA7CGCT7CCAAGT7GGAGAT7CAGTC7ATG77AGACGCC 5 52 0 
lyGlyAspLeuGlnValProHisArgPheGlnValGlyAspS^rValTyrValArgArgH 



5521 ACCGTGCAGGAAACCTCGAGACTCGGTGGAAGGGACCTTATCTCGTACTTTTGACCACAC 5 5 8 0 
isArgAlaGlyAsnLeuGluThrArgTrpLysGlyPrcTyrLeuValLeuLeuThrThrP 

5 581 CAACGGCTGTGAAAGTCGAAGGAATCCCCACCTGGATCCATGCATCCCACGTTAAGCCGG 56 4 0 
roThrAiaValLysValGluGlyllePcoThrTrpIleKisAlaSerHisValLysProA 

MecKisPrcThrLeuSerArg 

5641 CGCCACCTCCCGAC7CGGGGTGGAGAGCCGAAAAGAcTGAGAATCCCC77AAGC7TCGCC 57 00 
laProProProAsoSerGiyTrpArgAlaGlcLysThrGi fc :AsnP roLe'jLys LeuArgL 
ArgHisLeuPrcThrArgGlyGlyGluPrcLysArgLeuArglieProLeuSerPheAla 

57 01 TCCATCGCCTGGTTCCTTACTCTAACAMAACTCCCCAGGCCAGTAGTAAACGCC7TATA 5 7 60 
euHisArgLeuVai?roTyrSerAsr\AsriAsnSer P roGI yGi nLnd 
5erIleAlaTrp?heLeuThrLeuThrIleThrProGln.MaSerSerLysArgLeuIle 

57 61 GACAGCTCGAACCCCCA7AGACC7TTATCCC7TACC7GGC7GATTA77GACCC7GATACG 5 e 2 0 
AspSerSerAsn?:cHisArgProLeuSerLeuThrTr?UuI lelleAspProAspThr 
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5 821 GGTGTCACTGTAAATAGCACTCGAGGTGTTGCTCC7AGAGGCACCTGGTGGCCTGAACT3 58 3 0 
GiyValThrValAsnSerThrArgGlyValAlaProArgGlyThrTrpTrpProGluLeu 

5881 CATTTCTGCCTCCGATTGATTAACCCCGCTGTTAAAAGCACACCTCCCAACCTAGTCCGT 594 0 
HisPheCysLeuArgLeuIleAsnProAlaValLysSerThrFroProAsnLeuValArg 

5941 AGTTATGGGTTCTATTGCTGCCCAGGCACAGAGAAAGAGAAATACTGTGGGGGTT CTGGG 60 0 0 
SerTyrGiyPheTyrCysCysProGlyThrGluLysGluLysTyrCysGlyGlySerGIy 

6001 GAATCCTTCTGTAGGAGATGGAGCTGCGTCACCTCCAACGATGGAGACTGGAAATGGCCG 6060 
GluSerPheCysArgArgTrpSerCysValThrSerAsnAspGlyAspTrpLysTrpPro 

6061 A7CTCTCTCCAGGACCGGGTAAAATTCTCCTTTGTCAATTCCGGCCCGGGCAAGTACAAA 61*0 
IleSerLeuGlnAspArgVallysPheSerPheValAsnSerGlyProGlyLysTyrLys 

612 1 AT GAT GAAACT ATATAAAGATAAGAGCT G CT C C C CAT CAG ACTTAGATT ATCT AAAG AT A 618 0 
MetMetLysLeuTyrLysAspLysSerCysSerProSerAspLeuAspTyrLeuLysIle 

6181 AGTTTCACTGAAAGGAAAACAGGAAAATATTCAAAAGTGGATAAATGGTAT GAGCTGGGG 62 4 0 
SerPheThrGluArgLysThrGlyLysTyr5erLysValAspLysTrpTyrGluLeuGly 

62 41 AATAGTTTTTTATTATATGGCGGGGGAGCAGGGTCCACTTTAACCATTCGCCTTAGGATA 6 3 0 0 
Asr.SerPheLeuLeuTyrGlyGIyGlyAlaGlySerThrLeuThilleArgLeuAiglle 

6 3 01 6AGACGGGGACAGAACCCCCTGTGGCAATGGGACCCGATAAAGTACTGGCTGAACAGGGG 6 3 6 0 
GluThrGlyThrGluProProValAlaMetGlyProAspLysValLeuAlaGluGlnGly 

6361 CCCCCGGCCCTGGAGCCACCGCATAACTTGCCGGTGCCCCAATTAACCTCGCTGCGGCC- 64^0 
Pro?roAlaLeuGluProProHisAsnLeuProVa:?roGinLeuThr5erLeuArg?rc 

6421 GACATAACACAGCCGCCTAGCAACAGTAC CACTGGATTGATTCCTACCAACACGCCT^vGA 64 8 0 
AspIleThrGlnProProSerAsnSerThrThrGly-euIleProThrAsnThrProArg 

6481 AACTCCCCAGGTGTTCCTGTTAAGACAGGACAGAGACTCTTCAGTCTCATCCAGGGAGCT 654 0 
AsnSerProGlyValProValLysThrGlyGlnArgLeuPheSerLeuIleGlnGlyAla 

6541 TTCC^GCCATCAACTCCACCGACCCTGATGCCACTTCTTCT-GTTGGCTTTGTCTATCC 6600 
PheGlnAxalleAsnSerThrAspProAspAiaThrScrSerCysTrpLeuCysLeuSe: 

6601 TCAGGGCCTCCTTATTATGAGGGGATGGCTAAAGAAAGAAAATTCAATGTGACCAAAGAG 6 660 
SerG.yProProTyrTyrGluGlyMetAlaLysGluArgUsPheAsnValThrlysGlu 

6 661 CATAGAAATCAATGTACATGGGGGTCCCGAAATAAGC7TACCCTCACTGAAGTTTCCGGG 6 72 C 
KisArgAsnGlnCysThrTrpGlySe=ArgAsnLysLe:jThrLeuTnrGluValSerGlv 

6721 AAGGGGAGATGCATAGGAAAAGCTCCCCCATCCCA7CAACA77777GCTATAGTACTG7G i 7 8 0 
LysGl yTh rCys I leGl yLysAl a P rop rcSerHisGl nh:s Le jCysTy r S e rTnrVa ; 
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67 81 GTTTATGAGCAGGCCTCAGAAAATCAGTATTTAGTACCTGGTTATAACAGGTGGTGGGCA 6 8 4 0 
ValTyrGluGlnAlaSerGluAsnGlnTyrLeuValProGlyTyrAsnArgTrpTrpAla 

684 1 T GC AAT ACT GG GTTAACC CCCT GTGTTT C CAC CT CAGT CTT C AAC C AAT C C AAAGATTT C 69 00 
CysAsnThrGlyLeuThrProCysValSerThrSerValPheAsnGlnSerLysAspPhe 

6901 TGTGTCATGGTCCAAATCGTCCCCCGAGTGTACTACCATCCT GAGGAAGTGGTCCTTGAT 69 60 
CysValMetValGlnlleValProArgValTyrTyrHisProGluGluValValLeuAsp 

6961 GAATAT GACT AT CGGTATAAC CG AC CAAAAAG AGAAC C C GT AT CC CTT AC C CT AGCT GTA 7 02 0 
GluTyrAspTyrArgTyrAsnArgProLysArgGluProValSerLeuThrLeuAlaVal 

7 021 ATGCTCGGATTAGGGACGGCCGTTGGCGTAGGAACAGGGACAGCTGCCCTGATCACAGGA 70 8 0 
MetLeuGlyLeuGlyThrAlaValGlyValGlyThrGlyThrAlaAlaLealleThrGly 

7 081 CCACAGCAGCTAGAGAAAGGACTTGGTGAGCTACATGCGGCCAT GACAGAAGATCTCCGA 7 14 0 
ProGlnGlnLeuGluLys Gl y LeuGl yGl uLeuHisAlaAl aMe tThrGluAspLeuAr g 

7141 GCCTTAAAGGAGTCTGTTAGCAACCTAGAAGAGTCCCTGACTTCTTTGTCTGAAGTGGTT 72 00 
AlaLeuLysGluSerValSerAsnLeuGluGluSerLeuThrSerLeuSerGluValVal 

7 2 01 CTACAGAACCGGAGGGGATTAGATCTGCTGTTTCTAAGAGAAGGTGGGTTATGTGCAGCC 7 2 60 
LeuGlnAsnArgArgGlyLeuAspLeuLeuPheLeuArgGluGlyGlyLeuCysAiaAla 

72 61 TTAAAAG AAGAAT GTT G CTTCT AT GT AG AT C ACT CAGG AG C CAT CAG AGAC T C CAT G AAC 7 32 0 
LeuLysGluGluCysCysPheTyrValAspHisSerGlyAlalieArgAspSerMetAsn 

7 321 AAGCTTAGAAAAAAGTTAGAGAGGCGTCGAAGGGAAAGAGAGGCTGACCAGGGGTGGTTT 7 3 8 0 
LysLeuArgLysLysLeuGluArgArgArqArgGluArgGluAiaAspGlnGlyTrpPhe 

7 38 1 GAAGGATGGTTCAACAGGTCTCCTTGGATGACCACCCTGCTTTCTGCTCTGACGGGGCCC 7 4 4 0 
GluGlyTrpPheAsoArgSer ProTrpMetThrThrLeuLeuSerAiaLeuThrGlyPro 

7 4 41 CTAGTAGTCCTGCTCCTGTTACTTACAGTTGGGCCTTGCTTAATTAATAGGTTTGTTGCC 7 5 0 0 
LeuValValLeuLeuLeuLeuLeuThrValGlyProCysLeuIIeAsnArgPheValAla 

7 5 C 1 TTTGTTAGAGAACGAGTGAGTGCAGTCCAGATCATGGTACTTAGGCAACAGTACCAAGGC 7 56 0 
PheValArgGluArgValSerAlaValGlnlleMetValLeuArgGinGlnTyrGlnGly 

7 5 61 CTTCTGAGCCAAGGAGAAACTGACCTCTAGCCTTCCCAGTTCTAAGATTAGAACTATTAA 7 62 0 
LeuLeuSerGlnGlyGluThrAspLeuLnd 

7 621 CAAGACAAGAAGTGGGGAATGAAAGGAT GA : kAAT GC AACC7 AAC CCTCCCAGAACC CAGG 7 6 8 0 

7 66 1 AAGTTAATAAAAAGCTCTAAATGCCCCCGAATTACAGACCCTGCTGGCTGCCAGTAAATA 77 4 0 
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77 41 GGTAGAAGGTCACACTTCCTATTGTTCCAGGGCCTGCTATCCTGGCCTAAGTAAGATAAC 7 8 0 0 

7 8 01 AGGAAATGAGTTGACTAATCGCTTATCTGGATTCTGTAAAACTGACTGGCACCATAGAAG 7 8 60 

7 8 61 AATTGATTACACATTGACAGCCCTAGTGACCTATCTCAACTGCAATCTGTCACTCTGCCC 7 92 0 

7 921 AGGAGCCCACGCAGATGCGGACCTCCGGAGCTATTTTAAAATGATTGGTCCACGGAGCGC 7 98 0 

7 981 GGGCTCTCGATATTTTAAAATGATTGGTCCATGGAGCGCGGGCTCTCGATATTTTAAAAT 8 04 0 

8 041 GA7TGGTTTGTGACGCACAGGCTTTGTTGTGAACCCCATAAAAGC7GTCCCGATTCCGCA 810 0 
8101 CTCGGGGCCGCAGTCCTCTACCCCTGCGTGGTGTACGACTGTGGGCCCCAGCGCGCTTGG 8160 
8161 AAT AAAAATCCT CT T G C T GTTTG C AT C AAAAAAAAAAAAAAAAAAAAAA 82 0 9 
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The same nucleotide sequence as represented by bases 5260 to 3210 
in Figure 3 is also representative for this Figure, v-^ rh^ 
following changes: 



Position 


Char.ee 




5273 


G — T 




5341 


C-T 




5351 


C-T 




5353 


T-C 




5 3 5 6 


C-T 




5426 


G-A 




5464 


Insertion 


AG A 


5607 


C-T 




5633 


C-T 




5792 


T-C 




6191 


Insertion 


AA 


6253 


T-A 




6255 


Insertion 


A 


6900 


C-G 





Such nucleotide changes result in 
changes in the ENV polypeptide. 



relieving ar.inc 



a ci c 



Position 
7 

192 
193 
194 
197 
193 
199 
200 
2 01 
204 
205 
206 
206 
2 03 
209 
211 
212 
427 



Change 

R-W 

R-K 

Deletion 

Deletion 

V-Q 

S-E 

K-N 

V-I 

D-Q 

Y-I 

E-N 

Insertion; 

L-W 

N-I 

s-v 

L-V 
L-K 
F-L 
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SFV-3 
SFV-1 v .52 5 
HSRV 
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PoEV 

GALV , j BaEV 
' FeLV 




EIAV 



HIV- 



MMTV MPMV 



RSV 



1C% 



.MuLV murine leukaem.a virus 

FeLV feline leukaemia virus 

GaLV gibbon ape leukaemia v:rus 

SVV-1 simian foamy virus 1 

SFV-3 simian foamy virus 3 

r-.SRV human fcamy virus 

ouV Eovme leukaemia virus 

HTLV human T-cell leukaemia virus 

MM TV murine mammary tumour virus 

MPMV Mason Pficer monkey virus 

P.S'V Rous sarcoma virus 

HV felme immunodeficiency virus 

HIV human immuncdeficiencv virus 

= :AV scume infectious anaemia virus 
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AP-t 

PTS-1 -T^xr- ^ GACCCTGCTG 

- i - -in^Tr 1 ^ vp-J^T GC^. 1 -— — — w a — ■ 

nf-1 ;^. : . r ,.-T^.-^Ks:::s 

101 j CTg CC.-.GT^-----T_.-.C--- • : :-— * 

£73-1 /GAT A ~ - ^C-.-.C-TTG- - — ^--CTTAT 

, - 1 g^^^ TGTaja acTCacTGGcy cAT^^^- * 



GATA -» C r^lt^^i---->c-CGCGGGCTCr 

CATA CCAAT<- AP-1/CRS3 

4 C1 CG^TTT?J^ TCSAT^T7^^GwA^..-v. _ 

TATA _ _JT_ r^C^G-CCTCTAGCCCTG 

4 3 1 r^JAGCTGTCCCGATTCCGCAw . — --^ C -^ ^ * ~ 

poly A 

PADS ^ , — — zl-_-"CCCTTC- 

5 CI C 3TGG T GT.-. C G---CT GTGGGGC — ----- — 

U5 ! P9S 

SCI C C GAKC C CGGAC G.P--GGGGG-VT TGTTCTTT TACT G G CCT77CATTCG.3TGJI 
551 GT T G GC Z G GGAAAC CCT GCGAGC 



ETS- 



CCAAT 
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